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NOTATION  AND  ABBREVIATIONS 


SPE  stands  for  Steiner  problem  in  Euclidean  space. 
RSP  stands  for  rectilinear  Steiner  problem. 

SPG  stands  for  Steiner  problem  in  graphs. 

SPDG  stands  for  Steiner  problem  in  directed  graphs. 
SMT  stands  for  Steiner  minimum  tree. 

MST  stands  for  minimum  spanning  tree. 

ST  stands  for  Steiner  tree. 

FST  stands  for  fuU  Steiner  tree. 

M denotes  a set  of  nodes. 

A denotes  a set  of  arcs. 

C denotes  a set  of  arc  costs. 

denotes  a set  of  directed  arcs, 
denotes  a set  of  costs  associated  with  directed  arcs. 
TZ  denotes  a set  of  regular  nodes. 

S denotes  a set  of  Steiner  nodes. 

G denotes  a graph. 
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denotes  a directed  graph. 

0{%)  denotes  the  subset  of  arcs  that  are  departing  from  node  i. 
J(i)  denotes  the  subset  of  arcs  that  are  ending  at  node  i. 
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The  Steiner  problem  in  graphs  (SPG)  has  become  of  great  importance  due  to  its  many 
applications  in  areas  like  manufacturing  systems  and  communication  network  design.  As 
crucial  as  it  is,  the  problem  has  been  studied  over  the  last  few  decades  by  researchers 
challenged  by  its  difficulty.  Indeed,  several  heuristics  have  been  proposed  to  deal  with  the 
intractable  aspect  of  the  problem,  and  many  exact  algorithms  have  been  devised  to  take 
advantage  of  special  graph  structures;  a survey  of  the  state  of  the  art  is  given. 

In  order  to  tackle  the  SPG,  we  propose  three  new  formulations,  a heuristic,  an  exact 
algorithm,  and  the  first  test-problem  generator.  The  three  proposed  formulations  include 
two  integer  formulations  and  a continuous  formulation  which  is  the  first  of  its  kind  opening 
the  horizon  of  treating  the  SPG  as  a continuous  combinatorial  problem.  The  proposed 
heuristic  is  based  on  assigning  preference  weights  to  arcs  in  order  to  break  ties  among 
alternative  choices  that  occur  when  applying  an  SPG  heuristic  based  on  Prim’s  algorithm 

for  the  minimum  spanning  tree.  Using  one  of  the  two  proposed  integer  formulations,  namely 

• • 
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the  rooted-tree  formulation,  and  the  proposed  heuristic  to  generate  lower  bounds  and  a 
starting  upper  bound  respectively,  the  proposed  algorithm  is  based  on  a branch-and-bound 
scheme.  The  generator  uses  optimality  conditions  derived  from  the  rooted-tree  formulation 
to  construct  test  problems  with  known  optimal  solutions  in  linear  time.  Experimental 
results  are  reported,  and  some  suggestions  for  future  research  are  made. 
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CHAPTER  1 


INTRODUCTION 

Among  the  oldest  of  optimization  problems  in  mathematics,  the  Steiner  problem  finds 
its  roots  back  in  the  seventeenth  century  when  the  French  mathematician  Pierre  Fermat 
(1601-1665)  closed  his  celebrated  work  on  minima  and  maxima  [54]  by  posing  the  following 
problem  : 

Etant  donnes  trois  points,  en  trouver  un  quatrieme  tel  que  la  somme  de  ses 
distances  aux  trois  points  donnes  soit  minima. 

Already,  around  1640  Evangelista  Torricelli  and  Francisco  Cavalieri,  independently,  solved 
this  problem  of  finding  a fourth  point  such  that  the  sum  of  its  distances  to  three  given 
points  is  minimum.  In  their  geometric  solution,  the  fourth  point  has  to  be  chosen  in  such 
a way  that  joining  it  to  the  three  given  points  creates  three  angles  of  exactly  120  degrees 
(assuming  all  angles  in  the  triangle  induced  by  the  given  points  are  less  than  120  degrees). 
This  simple  but  instructive  problem  was  rediscovered,  over  the  centuries,  and  treated  by 
Jacob  Steiner,  the  famous  representative  of  geometry  at  the  University  of  Berlin  in  the 
early  nineteenth  century.  The  problem  treated  by  Steiner  was  to  find  a system  of  roads, 
of  minimum  total  length,  which  would  connect  three  given  villages  (see  Figure  1.1).  Later, 
in  the  early  twentieth  century,  this  problem  of  connecting  three  given  points  in  the  plane 
with  the  minimum  total  length  was  generahzed  to  connecting  N given  points  with  the 
minimum  total  length  by  Jarnik  and  Kossler  [82].  Although  this  paper  by  Jarnik  and 
Kossler  is  considered  to  be  the  first  paper  to  define  the  genuine  Steiner  problem  in  the 
Euclidean  plane  (SPE),  the  real  awakening  of  interest  in  the  SPE  is  due  to  its  inclusion  in 
the  popular  book  What  Is  Mathematics?  by  Courant  and  Robbins  [31]  in  1941,  who  also 
attached  the  name  Steiner  to  the  problem.  Originally  defined  for  the  Euclidean  distance 
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Figure  1.1.  Euclidean  Steiner  problem. 

metric,  the  Steiner  problem  was  later  generalized  to  include  other  problems  with  different 
types  of  distance  metric  and  (or)  different  connectivity  requirements.  Having  in  common 
the  feature  of  minimizing  the  total  length,  in  a specific  distance  metric,  to  meet  specific 
connectivity  requirements,  those  different  forms  of  the  Steiner  problem  wiU  be  introduced 
in  the  following  sections. 

The  organization  of  this  chapter  is  as  follows:  in  section  1.1,  we  introduce  the  Steiner 
problem  in  Euclidean  Space,  and  we  summarize  the  results  established,  so  far,  in  this  area. 
In  section  1.2,  the  rectilinear  Steiner  problem  is  discussed.  In  section  1.3,  we  introduce  the 
Steiner  problem  in  graphs  and  we  give  a survey  of  the  state  of  the  art  for  complexity  and  al- 
gorithm development  for  some  special  graph  structures.  Some  of  the  real-world  applications 
of  the  Steiner  problem  are  given  in  section  1.4  along  with  generalized  forms  of  the  Steiner 
problem  arising  in  real-world  situations.  In  section  1.5,  our  objectives  and  contributions 
are  summarized. 

1.1  The  Steiner  Problem  in  Euclidean  Space 

The  first  nontrivial  case  of  the  Steiner  problem  in  Euclidean  space  to  consider  is  the 
SPE.  The  SPE  can  be  defined  as  follows  [82].  Given  N regular  points  (7^-points),  in  the 
Euclidean  plane,  find  an  interconnection  network  linking  those  points  with  the  total  distance 
being  minimum.  Eventually,  this  network  has  the  properties  of  a Steiner  tree  (ST)  which 
are  given  below: 
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(i)  The  network  consists  of  a tree  T. 

(ii)  Any  two  edges  of  T meet  at  an  angle  of  at  least  120  degrees. 

(Hi)  A vertex  in  the  tree  that  is  not  one  of  the  given  points,  such  a vertex  is  called  a Steiner 
point  or  an  <S-point,  cannot  be  of  degree  one  or  two. 

Obviously,  (i)  and  (hi)  are  necessary  for  a shortest  interconnecting  network.  In  addition,  (ii) 
is  also  required  for  a shortest  interconnecting  network  using  an  argument  of  minimizing  the 
potential  of  forces  (edges)  [66].  Properties  (ii)  and  (hi)  imply  that  each  «S-point  is  of  degree 
exactly  three.  Properties  (i)  and  (hi)  imply  that  there  are  at  most  (N  — 2)  <S-points.  An 
ST  that  has  (A  — 2)  <S-points  is  full^  and  in  such  a tree,  denoted  by  FST,  every  given  point 
is  of  degree  one.  The  graph  structure,  P,  of  an  ST  is  caUed  a topology.  A full  topology 
(PP)  is  the  topology  for  some  FST.  By  a convexity  argument,  a topology  P can  have  at 
most  one  ST  which  would  correspond  to  the  shortest  interconnecting  network  for  P [66]. 

For  some  special  cases  treated  by  Jarnik  and  Kossler  [82],  Du  et  al.  [45],  Kotzig  [92], 
Weng  [164],  Du  et  al.  [43],  Provan  [122],  Chung  and  Graham  [23],  Du  and  Hwang  [41], 
Du  et  al.  [44],  Hwang  et  al.  [80],  Gardner  [59],  Chung  et  al.  [22],  Georgakopoulos  and 
Papadimitriou  [64],  and  Hwang  [77],  an  SPE  exact  solution  can  be  found  in  polynomial 
time.  While  the  SPE  is  tractable  for  some  instances,  it  is  shown  to  be  an  NP-hard  problem 
in  general  [60].  So,  it  is  unhkely  to  find  polynomial-time  algorithms  that  find  exact  solutions 
for  the  general  SPE.  Melzak  [105],  Cockayne  [27],  and  Winter  [168]  propose  exact  algorithms 
manipulating  full  topology  substructures  and  using  reduction  and  restriction  techniques  to 
reduce  the  number  of  cases  to  consider,  refer  to  Hwang  and  Richards  [78]  for  a survey. 
A new  approach  for  constructing  an  SPE,  however,  is  proposed  by  Trietsh  and  Hwang 
[154].  This  approach  would  compare  favorably  to  the  classical  approach  if  there  exists 
an  efficient  algorithm  to  find  the  ST  in  each  P(PP),  where  D{FP)  denotes  the  set  of 
topologies,  including  PP,  that  can  be  obtained  from  PP  by  sequentially  collapsing  .S-points 
into  adjacent  7^-points. 
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Interesting  enough  are  solution  methods  which  employ  techniques  that  use  natural  sys- 
tems’ tendencies  to  find  minimum  energy  configurations  to  solve  optimization  problems. 
Among  those  methods,  mechanical  solutions  for  the  SPE  by  using  pegs  and  elastic  bands 
or  soap  films  are  nicely  covered  by  Miehle  [106],  Courant  and  Robbins  [31],  Gallawa  [58], 
Clark  [26],  Polya  [117],  Gilbert  and  Poliak  [66],  and  Palermo  [111].  Also,  a solution  using 
a rubber  membrane  is  provided  by  Soukup  [146].  True,  these  methods  have  physical  sense, 
and  they  are  appeahng  for  intuition.  However,  from  the  mathematical  and  computational 
perspectives,  they  are  not  as  rigorous  as  mathematical  solutions. 

Not  to  be  overlooked,  methods  based  on  standard  minimization  techniques  can  be  em- 
ployed to  solve  for  the  optimal  locations  of  <S-points.  However,  the  equations  involved  are 
nonlinear  so  that  an  iterative  method  is  required  [106].  Recently,  Smith  [144]  gave  such 
a method  which  has  a geometric  convergence  rate.  Along  another  line,  a probabilistic 
approach  using  “simulated  anneahng”  is  proposed  by  Lundy  [99,  100].  In  the  same  line 
of  probabilistic  approaches,  Karp  [88]  gives  a partition  algorithm  for  the  salesman  problem 
that  could  be  modified  easily  to  fit  the  SPE,  and  the  algorithm  is  shown  to  be  a probabilistic 
c- approximation  scheme.^ 

To  deal  with  the  SPE  in  a tractable  fashion,  there  are  several  polynomial  time  heuris- 
tics to  approximate  a Steiner  minimum  tree  (SMT)  in  the  Euchdean  plane.  Approaches 
proposed  by  Chang  [19],  Thompson  [152],  Korhonen  [90],  Smith  and  Liebman  [141],  and 
Beasley  [11]  are  based  on  “local  Steinerizations,”  i.e.,  adding  an  5"-point  to  connect  three 
points  “close”  to  each  other  and  delete  an  edge  from  any  loops  formed.  Another  type  of 
approach  using  Voronoi  diagrams  and  Delauney  triangulations  is  proposed  by  Smith  et 
al.  [140].  Other  heuristic  approaches  that  run  in  strongly  polynomial  time  are  proposed  by 
Smith  [143],  Suzuki  and  Iri  [149],  and  Beasley  [9].  Compared  to  most  of  the  polynomial-time 

heuristics,  the  minimum  spanning  tree  (MST)  heuristic  is  the  most  popular.  Its  popularity 

probabilistic  c- approximation  scheme  is  defined  to  be  an  algorithm  that  constructs  a tree  T in  time 
polynomial  in  1/e  such  that  the  ratio  of  the  lengths  of  T and  the  minimum  Steiner  tree  is  bounded  by  1-f-e 
with  probability  one. 
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is  due  to  complexity  and  performance  issues.  In  fact,  the  best  MST-based  algorithm  is 
implemented  in  0{N  log  N)  as  shown  in  Shamos  and  Hoey  [136],  where  N is  the  number 
of  given  points  to  connect.  As  far  as  performance  is  concerned,  MST-based  algorithms  are 
competitive  with  other  known  algorithms  since  the  improvement  in  expected  length  using 
the  other  algorithms  is  only  about  3 or  4 percent,  and  there  is  no  worst-case  improvement 
proven.  The  question  of  worst-case  performance  of  MST-based  heuristics  has  caused  much 
excitement  among  the  operations  research  community.  Gilbert  and  Poliak  [66]  conjectured 
in  1968  that  in  the  Euclidean  plane,  the  performance  ratio  p,  defined  as  the  ratio  of  the 
lengths  of  the  SMT  and  the  MST,  is  larger  than  or  equal  to  ^ for  any  number  N of  given 
points,  and  they  verified  the  conjecture  for  N = 3.  The  conjecture  was  then  verified  by 
PoUak  [116]  for  iV  — 4,  by  Du  et  al.  [46]  for  iV  = 5 and  by  Rubinstein  and  Thomas  [132]  for 
N = 6.  Along  another  line,  the  lower  bound  on  the  performance  ratio  p was  pushed  from 
0.5  by  Moore  as  reported  in  Gilbert  and  Poliak  [66]  to  0.57  by  Graham  and  Hwang  [68], 
to  0.74  by  Chung  and  Hwang  [25],  to  0.8  by  Du  and  Hwang  [39]  and  to  0.824  by  Chung 
and  Graham  [24].  This  thirst  for  proving  the  conjecture  has  been  quenched  recently  by  Du 
and  Hwang  [42]  when  they  proved  the  conjecture  to  be  true  for  all  A.  In  their  proof,  they 
identify  critical  structures  of  full  topology,  and  they  show  that  to  verify  the  conjecture  in 
general,  it  suffices  to  verify  it  for  these  critical  structures. 

We  have  discussed  so  far  the  SPE.  For  higher- dimension  Euclidean  spaces,  Gilbert  and 
PoUak  [66]  showed  that  most  of  the  basic  properties  of  Steiner  minimal  trees,  including  the 
120-degree  and  degree-3  conditions  on  5-points,  stiU  hold.  Also,  they  conjectured  that  the 
lower  bound  on  the  performance  ratio  p is  the  same  for  any  dimensional  Euclidean  space. 
However,  Du  [37]  has  proven  recently  that  this  conjecture  does  not  hold  for  Euclidean  spaces 
of  dimension  higher  than  two.  In  the  next  section,  a related  problem  to  the  SPE,  caUed 
the  rectilinear  distance  Steiner  problem,  is  discussed.  The  basic  difference  between  the  two 
problems  is  in  their  distance  metrics. 
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Figure  1.2.  An  RSP  grid  graph  corresponding  to  the  given  points  marked  by  filled  circles. 


1.2  The  Rectilinear  Steiner  Problem 


Given  N points  in  the  plane,  the  rectilinear  Steiner  problem  (RSP)  consists  of  finding  a 
connected  network  that  has  the  least  total  length  where  the  distance  metric  is  the  rectilinear 
or  Manhattan  metric,  i.e.,  the  distance  from  a to  6 is  \xa  — Xh\  + \ya  — Vbl-  Hanan  [71] 
was  the  first  to  investigate  the  problem.  In  his  paper,  he  proves  that  there  is  always  an 
optimal  connected  network  using  edges  in  the  grid  graph  obtained  by  extending  vertical 
and  horizontal  lines  through  every  given  point  (see  Figure  1.2)  and  taking  the  intersection 
of  those  lines  to  be  the  set  of  nodes  in  the  graph;  furthermore,  there  is  an  edge  or  arc 
between  two  nodes  if  they  are  directly  connected  by  a line,  horizontally  or  vertically.  With 
this  important  result,  the  RSP  reduces  to  a special  case  of  the  Steiner  problem  in  graphs; 
see  section  1.3. 

The  RSP  decision  problem  is  NP-complete  [61].  Although  any  exact  algorithm  for  the 
Steiner  problem  in  graphs  (see  section  1.3)  applies  to  the  RSP,  the  only  known  exact  algo- 
rithm for  the  RSP  is  proposed  by  Yang  and  Wing  [177].  This  algorithm  is  a straightforward 
branch-and-bound  approach,  and  it  seems  to  be  apphcable  only  for  N < 10.  For  the  case 
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when  the  topology  is  fixed,  an  0{N)  time  dynamic  programming  algorithm  is  proposed  by 
Sankoff  and  Rousseau  [134].  Also,  of  interest  are  cases  when  all  given  points  lie  on  the 
boundary  of  a grid  graph  and  when  all  given  points  lie  on  the  boundary  of  their  rectilin- 
ear convex  hull.  For  these  cases,  polynomial-time  algorithms  are  given  by  Aho  et  al.  [2], 
Agarwal  and  Shing  [1],  Bern  [12,  13],  Richards  and  Salowe  [129],  and  Proven  [122].  Explor- 
ing another  direction,  Thomborson  et  al.  [151]  propose  a probabihstic  approach  where  an 
jSfO(VN)  algorithm  computes  the  optimal  RSP  solution  with  high  probability,  assuming 
that  the  given  points  are  uniformly  distributed  in  the  unit  square. 

For  practical  use,  heuristic  methods  are  needed  to  deal  with  the  RSP.  Most  heuristics 
for  approximating  the  Steiner  minimum  tree  (SMT)  for  the  RSP  are  based  on  greedy  ap- 
proaches. Certainly,  a very  simple  one  among  such  heuristics  is  the  minimum  spanning  tree 
(MST)  heuristic.  This  heuristic  requires  O(A^)  time.  Also,  Hwang  [74]  proved  that  the 
performance  ratio  of  the  MST  heuristic,  which  is  the  ratio  of  the  lengths  of  the  SMT  and 
the  MST,  is  at  least  2/3,  where  the  bound  is  tight.  Other  greedy  heuristics  are  given  by 
Bern  and  Carvalho  [14]  and  Hwang  [76].  On  the  average,  these  heuristics  have  a range  of 
improvement  over  the  minimum  spanning  tree  heuristic  from  4 to  9 percent;  however,  none 
of  them  has  a proven  improvement  in  the  worst-case  performance. 

Along  another  heuristic  hne,  a greedy  approach  using  information  from  collected  statis- 
tics is  proposed  by  Jiang  and  Tang  [83],  and  a probabihstic  c- approximation,  a concept  seen 
in  section  1.1  with  Karp’s  partitioning  algorithm,  is  given  by  Komlos  and  Shing  [89].  In 
addition,  any  heuristic  that  is  designed  for  the  Steiner  problem  in  graphs  could  apply  for 
the  RSP  since  the  grid  graph  corresponding  to  an  RSP  is  only  a special  case  of  graphs. 
In  the  next  section,  we  discuss  a class  of  Steiner  problems,  namely  the  Steiner  problem  in 
graphs,  in  which  the  RSP  constitutes  an  instance. 

1.3  The  Steiner  Problem  in  Graphs 

Posed  independently  by  Hakimi  [70]  and  Levin  [96],  the  Steiner  problem  in  graphs  (SPG) 
consists  of  connecting  a subset  of  given  nodes  on  a graph  with  the  minimum  cost  tree. 
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Although  the  SPG  is  considered  as  the  combinatorial  extension  of  the  Steiner  problem  in 
Euclidean  space  (SPE),  the  two  problems  have  httle  in  common,  including  the  buffer  zone 
area  corresponding  to  the  rectihnear  Steiner  problem  discussed  in  the  previous  section. 
Karp  [86,  87]  showed  earher  that  the  SPG  decision  problem  is  NP-complete  in  general. 
Even  for  some  restrictions  hke  grid  graphs  [61],  equal-edge- weight  graphs  (as  claimed  by 
Berlekamp),  bipartite  graphs  [62],  chordal  and  spht  graphs  [166],  the  problem  remains  NP- 
complete.  Moreover,  on  directed  graphs.  Provan  and  Ball  [124]  showed  that  the  variant  of 
the  rooted  Steiner  arborescence  of  a directed  graph  is  NP-complete.  NP-completeness  still 
sticks  to  the  problem  with  planar  acychc  digraphs  with  indegree  and  outdegree  at  most  two 
[121].  However,  there  are  some  instances,  which  we  present  in  the  following  subsections,  for 
which  polynomial  time  algorithms  exist.  In  what  follows,  we  give  a short  survey  of  existing 
algorithms  and  heuristics  for  the  Steiner  problem  in  graphs  for  general  graphs  and  for  some 
special  graphs.  See  Hwang  et  al.  [79],  Voss  [160],  and  MacGregor  Smith  and  Winter  [101] 
for  excellent  sources  of  information  on  the  subject. 

1.3.1  General  Graphs 

AU  known  exact  SPG  algorithms  for  general  graphs  are  in  some  way  enumerative  algo- 
rithms. However,  they  differ  in  how  the  enumeration  is  done  and  how  clever  their  strategies 
are  to  avoid  total  enumeration.  In  order  to  avoid  total  enumeration,  topological  and  other 
properties  of  general  graphs  can  be  used.  Another  alternative  is  to  develop  good  lower  and 
upper  bounds  in  order  to  reduce  the  possible  search  space.  Two  simple  approaches  are 
proposed  by  Hakimi  [70].  While  one  of  them  is  based  on  spanning  tree  enumeration,  the 
other  approach  is  a topology  enumeration  recursive  algorithm.  Another  algorithm,  which 
generates  spanning  trees  for  a derived  problem  in  order  of  increasing  total  length  until  a 
solution  to  the  original  problem  can  be  inferred,  is  proposed  by  Balakrishnan  and  Patel 
[5,  6].  Also,  implicit  enumeration  approaches  are  given  by  Shore  et  al.  [137],  and  Yang  and 
Wing  [177],  where  the  set  of  all  trees  spanning  the  set  of  given  points  is,  in  a systematic 
way,  separated  into  smaller  subsets  to  be  analyzed,  using  upper  and  lower  bounds,  in  order 
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to  determine  whether  or  not  they  contain  the  optimal  feasible  solution.  Along  another  line, 
straightforward  dynamic  programming  approaches  are  proposed  by  Dreyfus  and  Wagner 
[34],  and  Levin  [96].  A branch  and  bound  approach  that  uses  heuristics  to  provide  good 
lower  bounds  and  to  choose  the  next  edge  for  consideration  in  the  backtracking  process,  is 
proposed  by  Foulds  and  Gibbons  [55],  and  Shore  et  al.  [137].  Other  approaches,  including 
branch  and  bound,  are  proposed  by  Beasley  [8,  9],  Aneja  [3],  Maculan  [102],  Lucena  [98], 
Wong  [175],  and  Chopra  et  al.  [21].  Exact  algorithms  for  the  SPG,  however,  can  benefit 
from  preprocessing  the  graph  so  that  the  size  of  the  problem  to  solve  is  reduced.  Reduction 
techniques  are  discussed  in  Beasley  [8],  Balakrishnan  and  Patel  [6],  Duin  and  Volgenant 
[49,  50],  Voss  [158,  159],  Iwainsky  et  al.  [81],  and  Chopra  et  al.  [21].  While  some  of  those 
algorithms  mentioned  above  are  better  then  others,  they  still  run  in  exponential  time  in  the 
worst  case.  To  deal  with  this  intractable  aspect,  researchers  have  developed  heuristics  to 
give  approximate  solutions  for  the  SPG. 

Many  heuristics  have  been  proposed  to  give  approximate  solutions  for  the  SPG.  They 
can  be  classified  into  two  categories.  The  first  type  of  heuristics  use  suboptimal  variants  of 
exact  algorithms.  The  second  type,  on  the  other  hand,  is  based  on  already  known  graph 
algorithms,  for  instance  the  minimum  spanning  tree  (MST)  algorithm.  The  reader  is  referred 
to  Winter  [171],  and  Hwang  and  Richards  [78]  for  surveys.  In  the  first  category,  several 
suboptimal  variations  of  exact  algorithms  have  been  proposed.  For  example,  in  Aneja  [3], 
a greedy  variation  of  the  set  covering  algorithm  is  presented,  and  an  MST-based  routine  is 
used  as  a shortcut  in  the  dual  ascent  algorithm  given  by  Wong  [175].  Based  on  existing  graph 
algorithms,  heuristics  in  the  second  category  are  numerous.  The  first  one  of  such  heuristics 
is  MST-based,  and  it  was  first  proposed  by  El- Arbi  [51]  and  independently  by  Plesnik  [115], 
Kou  et  al.  [93],  Bharath-Kumar  and  Jaffe  [15],  and  Iwainsky  et  al.  [81].  A heuristic  based 
on  Kruskal’s  MST  algorithm  is  given  by  Wu  et  al.  [176].  Another  heuristic  approach  based 
on  Prim’s  MST  algorithm  is  given  by  Takahashi  and  Mutsuyama  [150].  While  all  of  the 
above  approaches  implicitly  choose  the  Steiner  nodes  (optional  nodes  in  the  Steiner  tree). 
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an  average  distance  heuristic  that  selects  those  nodes  is  given  by  Ray  ward- Smith  [127];  refer 
also  to  Rayward-Smith  and  Clare  [128],  and  Foulds  and  Rayward-Smith  [57].  A different 
heuristic  based  on  neighborhood  contraction  techniques  is  provided  by  Plesnik  [115].  All  of 
the  heuristic  approaches  mentioned  above  are  different.  However,  they  have  two  features 
in  common:  They  aU  run  in  strongly  polynomial  time  and  their  performance  ratio,  i.e.,  the 
ratio  of  lengths  of  the  SMT  and  MST,  is  larger  than  one  half.  Although  the  SPG  heuristics 
and  algorithms  surveyed  so  far  are  designed  for  any  type  of  graph,  it  is  sometimes  more 
appropriate  to  design  specific  algorithms  for  specific  types  of  graphs  instead  of  adopting 
general  purpose  algorithms.  In  the  following  material,  some  special  types  of  graphs  along 
with  SPG  algorithms  designed  for  them  are  surveyed  briefiy. 

1.3.2  Partial  A^-Tree  Graphs 

The  class  of  partial  A;-tree  graphs  and  its  various  subclasses  have  many  efficient  algo- 
rithms for  the  SPG  that  run  in  hnear  time  [84,  85].  The  general  case  of  partial  A;- tree 
graphs,  for  a constant  is  treated  in  a very  general  way  by  Bodlaender  [18].  A partial 
k-tree  graph  can  be  characterized  inductively  as  follows: 

(1)  A complete  graph^  on  k vertices  is  a fc-tree  graph. 

(2)  Suppose  that  a graph  is  a k-tree  graph,  and  it  contains  a subset  of  k nodes  that  induces 

a complete  subgraph  in  the  original  graph.  Then,  the  augmented  graph  obtained  by 
adding  a new  node  to  the  original  graph  together  with  edges  from  the  added  node  to 
every  node  in  the  subset  of  k nodes  that  induces  a complete  subgraph  of  the  original 
graph,  is  also  a fc-tree  graph. 

Moreover,  a graph  is  a partial  k-iree  graph  if  there  is  some  /i;-tree  graph  of  which  it  is  a 
subgraph.  Originally,  linear-time  SPG  algorithms  were  estabhshed  for  subclasses  of  k-tree 
graphs,  namely  for  series-parallel  and  Hahn  graphs. 


2 

A complete  graph  is  a graph  where  all  nodes  are  adjacent  pairwise. 
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Series-ParaUel  Graphs 

A graph  is  called  series-parallel  (sp)  (see  figure  1.3a)  if  and  only  if  it  contains  no  subgraph 
homeomorphic  to  1^4,  i.e.,  a complete  graph  on  4 vertices  or  nodes  (see  Figure  1.3b).  Clearly, 

every  sp  graph  is  planar.  An  sp  graph  with  neither  loops  nor  parallel  edges  is  maximal 

* 

series  parallel  (msp)  if  no  edges  between  nonadjacent  vertices  can  be  added  to  it  so  that  it 
remains  an  sp  graph  (see  Figure  1.3c).  An  msp  graph  is  a 2-tree  graph.  Any  msp  graph 
with  n nodes  has  (2n  — 3)  arcs,  and  it  can  be  constructed  from  any  sp  graph  that  has  n 
nodes  in  0{n)  time.  A 0(n)-time  algorithm  to  find  the  SMT  on  msp  graphs  is  proposed 
by  Wald  and  Colbourn  [163],  and  by  Prodon  et  al.  [120]  (with  a shghtly  modified  version). 
As  a result,  the  SMT  on  every  sp  graph  can  be  found  in  0{n)  time  since  one  can  complete 
any  sp  graph  to  an  msp  graph  where  aU  added  edges  are  associated  with  very  large  costs 
in  order  to  insure  that  an  MST  on  the  constructed  msp  graph  is  an  MST  on  the  original 
sp  graph.  A similar  algorithm  for  outerplanar  graphs  is  given  by  Wald  and  Colbourn  [162], 
where  an  outerplanar  graph  is  a graph  that  can  be  embedded  in  the  plane  such  that  aU  its 
nodes  are  on  the  exterior  face;  notice  that  every  outerplanar  graph  is  an  sp  graph. 

Halin  Graphs 

A Halin  graph,  H ==  T U C,  is  a graph  obtained  by  a planar  embedding  of  a tree  T and 
then  adding  arcs  between  the  consecutive  leaves  to  form  a cycle  C (see  Figure  1.4a).  If  T 
is  a star,  i.e.,  it  contains  a node  i which  is  adjacent  to  aU  other  nodes,  then  H is  called  a 
wheel,  and  i is  its  center  (see  Figure  1.4b).  If  there  is  exactly  one  node  j,  among  all  the 
adjacent  nodes  to  i,  that  is  not  in  C,  then  the  subgraph  induced  by  node  i and  its  adjacent 
nodes  excluding  j is  called  a fan  with  center  i.  For  example,  black  nodes  in  Figure  1.4c 
are  centers  of  fans  (surrounded  by  dashed  boxes).  Notice  that  every  Hahn  graph  which  is 
not  a wheel  has  at  least  two  fans.  A hnear-time  algorithm  for  finding  the  Steiner  minimum 
tree  on  Hahn  graphs  that  is  based  on  a technique  cahed  “fan  contraction”  is  presented  by 
Winter  [172]. 


(b)  K4  graph. 


a)  sp  graph 


Figure  1.3.  Graph  instances. 


(a)  A Halin  graph. 


Figure  1.4.  Halin  graphs. 
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1.3.3  Perfect  Graphs 

A graph  is  called  perfect  if  for  any  induced  subgraph  of  it,  the  chromatic  number^  of 
the  induced  subgraph  is  equal  to  its  maximum  chque  size.'^  The  SPG  for  this  class  of 
graphs  is  NP-complete  in  general  [84];  indeed,  it  is  NP-complete  for  special  instances  like 
strictly  chordal  graphs  (even  with  strictly  triangular  weight  functions),  bipartite  graphs, 
comparability  graphs,  split  graphs  and  chordal  graphs.  However,  there  are  instances  in 
this  class  of  graphs,  i.e.,  the  class  of  perfect  graphs,  where  polynomial-time  algorithms  are 
known.  Such  instances  are  permutation  graphs,  distance-hereditary  graphs,  homogeneous 
graphs  and  strongly  chordal  graphs  with  unit  arc  costs. 

Permutation  Graphs 

A permutation  graph  is  a graph  for  which  there  is  a labeling  {ui,  • • • ^Vn}  of  n nodes 
and  a permutation  vector  tt  = (tti,  • • • , tt^)  of  {1,  • • *,  n}  for  which  {i  - j){7Ti  - TTj)  < 0 if 
and  only  if  (vi^vj)  is  an  arc.  A polynomial-time  algorithm  for  the  SPG  on  permutation 
graphs  is  proposed  by  Colbourn  and  Stewart  [29].  A subclass  of  permutation  graphs  called 
cographs  has  also  been  studied.  A graph  is  called  a cograph  if  it  is  a permutation  graph 
and  it  has  no  induced  subgraph  which  is  a 4-node  path.  The  SPG  on  cographs  is  solvable 
in  polynomial  time. 

Distance-Hereditary  Graphs 

A graph  is  said  to  be  a distance-hereditary  graph  if  every  cycle  in  the  graph  with  five 
or  more  nodes  has  two  crossing  chords.  A polynomial-time  algorithm  for  finding  the  SMT 
on  distance- hereditary  graphs  is  given  by  D’Atri  and  Moscarani  [32]. 

Homogeneous  Graphs 

The  subclass  of  homogeneous  graphs  is  an  extension  of  the  subclass  of  distance-hereditary 

graphs.  A homogeneous  set  in  a graph  is  a proper  subset  of  nodes,  which  contains  two  or 

The  chromatic  number  of  a graph  is  the  minimum  number  of  colors  needed  to  color  the  nodes  of  the 
graph  where  no  adjacent  nodes  are  colored  with  the  same  color. 

^The  maximum  clique  size  of  a graph  is  the  cardinality  of  the  largest  subset  of  nodes  which  induces  a 
complete  subgraph  of  the  original  graph. 
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more  nodes,  such  that  every  remaining  node  in  the  graph  is  adjacent  to  either  all  or  none 
of  the  nodes  in  the  subset.  A homogeneous  set  is  said  to  be  maximal  if  there  is  no  homo- 
geneous set  properly  containing  it.  If  the  reduction  of  a connected  graph  with  respect  to  a 
subset  of  its  nodes  is  defined  as  the  graph  obtained  by  replacing  the  subset  of  nodes  with  a 
single  node  having  the  same  neighbors  as  the  replaced  subset  of  nodes,  then  homogeneous 
graphs  are  recursively  defined  in  terms  of  reduction  of  their  two  connected  components 
with  respect  to  maximal  homogeneous  sets.  A polynomial-time  algorithm  for  the  SPG  on 
homogeneous  graphs  is  given  by  D’Atri  et  al.  [33] 

Strongly  Chordal  Graphs  with  Unit  Arc  Costs 

A graph  is  called  chordal  if  every  cycle  with  four  or  more  edges  has  a chord.  A graph  is 
called  strongly  chordal  if  it  is  chordal  and  every  even  cycle  with  six  or  more  edges  or  arcs 
has  an  odd  chord,  i.e.,  a chord  dividing  the  cycle  into  two  paths  containing,  each,  an  odd 
number  of  edges.  A polynomial  time  algorithm  for  the  SPG  on  strongly  chordal  graphs 
with  unit  arc  costs  is  proposed  by  White  et  al.  [166]. 

1.3.4  Planar  Graphs 

A graph  is  called  planar  if  it  can  be  embedded  in  the  plane,  with  arcs  allowed  to  cross 
only  at  nodes.  The  SPG  on  planar  graphs  in  general  is  NP-complete.  Special  cases  of  planar 
graphs  are  grid  graphs  and  A:-planar  graphs. 

Grid  Graphs 

Grid  graphs  are  subgraphs  of  the  two-dimensional  rectiUnear  grid  (see  section  1.2).  As 
mentioned  in  section  1.2,  the  SPG  on  grid  graphs  is  NP-complete.  However,  there  are 
some  instances  w^here  polynomial-time  algorithms  exist.  When  aU  n given  points  he  on  the 
boundary  of  the  grid  graph,  which  has  p rows  and  q columns,  an  0{p^q  + pq^)  or  O(n^) 
time  algorithm,  which  is  improved  to  0{n)  time  if  p = 2,  is  given  by  Aho  et  al.  [2].  Also, 
Agarwal  and  Shing  [1],  and  Cohoon  et  al.  [28]  propose  an  0{p  -f  q)  time  algorithm  for  the 


same  case. 
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An  extension  of  the  above  case  is  to  restrict  the  given  points  to  lie  on  the  boundary 
of  their  rectilinear  convex  hull.  For  this  special  case,  an  algorithm,  which  is  based  on  an 
algorithm  for  1-planar  graphs,  was  proposed  by  Bern  [12]  and  Provan  [122]  independently. 
This  algorithm  runs  in  0{n^)  time.  By  using  an  advanced  data  structure,  the  complexity 
was  reduced  by  Bern  to  O(n^).  Recently,  Bern  [13]  proposed  an  time  algorithm 

for  the  generalized  case  where  h of  the  n given  points  lie  on  the  rectilinear  convex  hull.  For 
the  case  where  the  convex  hull  has  k number  of  “sides,”  an  0{k^n)-time  algorithm  is  given 
by  Richards  and  Salowe  [129]. 

Another  special  case  is  the  case  of  minimal  distance  rectangular  trees  (MDRT).  A rect- 
angular tree  is  a subgraph  where  every  point  is  on  the  outer  boundary  and  adjacent  to  two 
other  boundary  points.  An  MDRT  is  a rectangular  tree  that  spans  the  given  points  and 
preserves  the  length  of  shortest  paths  between  the  given  points  on  the  grid  graph.  Farley  et 
al.  [53]  propose  a linear  time  algorithm  to  find  the  MST  for  a case  restricted  to  an  MDRT. 
Also,  a polynomial-time  algorithm  is  given  by  Winter  [174]  to  decide  if  an  MDRT  exists 
and  to  give  one  if  so. 

/v -Planar  Graphs 

A graph  that  has  a planar  embedding  with  ail  (or  all  but  one)  of  the  given  nodes,  i.e., 
the  nodes  to  be  connected,  on  the  boundary  of  at  most  k faces  is  called  A;-planar  (or  nearly 
fc-planar),  respectively.  A polynomial-time  algorithm  to  find  such  an  embedding,  for  a fixed 
/?,  is  presented  by  Bienstock  and  Monma  [16].  By  modifying  the  dynamic  programming 
algorithm  proposed  by  Dreyfus  and  Wagner  [34],  the  SPG  for  those  special  cases,  fc-planar 
and  nearly  fc-planar  graphs,  can  be  found  in  polynomial  time  given  k [52,  13].  Those  results 
are  generalized  by  considering  A:-planarity  as  a convexity  property  [122],  and  a polynomial- 
time algorithm  to  recognize  some  of  these  convexity  properties  is  given  by  Bienstock  and 
Monma  [17]. 
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1.4  Applications  and  Generalizations  of  the  Steiner  Problem 

The  Steiner  problem  has  applications  in  many  areas.  In  this  section,  we  present  a survey 
of  some  of  these  applications.  Also,  we  present  some  known  generalized  forms  of  the  Steiner 
problem  which  arise  in  real  world  situations. 

1.4.1  Steiner  Problem  in  Manufacturing 

The  Steiner  problem  has  a number  of  applications  in  manufacturing  systems.  Those 
apphcations  range  from  specific  applications  fike  the  construction  of  printed  circuit  boards 
in  very  large  scale  integration  (VLSI)  automated  manufacturing  systems  to  plant  and  cell 
layout  in  fiexible  manufacturing  systems.  This  variety  of  applications  in  manufacturing  is 
due  to  the  Steiner  problem  being  as  a special  case  of  the  network  design  problem  [104], 
an  area  that  is  strongly  tied  to  automated  and  fiexible  manufacturing  systems.  For  more 
details,  the  reader  is  referred  to  Winter  [171],  Hwang  [75],  Korte  [91],  Rose  et  al.  [130],  and 
Hanan  [72]. 

1.4.2  Steiner  Problem  in  Phvlogenv 

An  interesting  theoretical  apphcation  for  the  Steiner  problem  is  in  relation  to  the  theory 
of  evolution  due  to  Charles  Darwin.  In  his  theory,  Darwin  claims  that  existing  biological 
species  have  been  linked  in  the  past  by  common  ancestors.  A tree  representing  ancestor 
relationships  is  called  a phylogenetic  tree,  and  it  can  be  constructed  by  examining  differences 
in  DNA  codes.  First,  a complete  graph  is  constructed  where  every  node  corresponds  to 
a different  DNA  code,  which  in  turn  corresponds  to  a different  species,  and  the  weight 
associated  with  an  edge  between  any  two  nodes  is  proportional  to  the  number  of  differences 
in  corresponding  positions  in  the  strings  of  their  DNA  codes.  On  this  graph,  connecting  a 
subset  of  nodes  with  the  Steiner  minimum  tree  corresponds  to  finding  a phylogenetic  tree 
for  the  corresponding  species.  For  more  information  on  the  topic,  see  Hwang  and  Richards 
[78],  Winter  [171],  and  Foulds  and  Graham  [56]. 
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1.4.3  Generalized  Steiner  Problems 

In  real-world  applications,  some  problems  requiring  the  design  of  some  optimal  con- 
nection network  are  encountered.  Those  problems  are  different  from  the  forms  of  Steiner 
problems  discussed  so  far  by  the  connectivity  requirements  and  (or)  by  the  cost  measure- 
ment, corresponding  to  the  connecting  network,  to  optimize.  Therefore,  we  consider  those 
problems  as  generalized  forms  of  the  Steiner  problem.  In  what  follows,  we  present  a set  of 
generahzed  Steiner  problems  that  are  related  to  real-world  applications.  For  more  detailed 
surveys,  the  reader  is  referred  to  Voss  [157],  and  Hwang  and  Richards  [78]. 

GSFl:  Steiner  Problem  for  Object  and  Obstacle  Sets 

This  problem  consists  of  connecting  a set  of  objects  in  a space  containing  a set  of 
obstacles  with  the  minimum  cost  interconnecting  network.  We  encounter  such  problems  in 
pipehne  layout  or  routing  heating  mains;  refer  to  Smith  and  Liebman  [141],  Liestman  [97], 
Smith  [138],  Provan  [123],  and  Smith  and  Winter  [142]  for  detailed  analysis  and  special  case 
studies. 

GSP2:  Steiner  Problem  for  an  Arbitrary  Metric  Space 

This  type  of  generahzation  arises  from  industrial  problems  and  in  modeling  drainage 
networks  [73].  For  the  interested  reader,  more  detailed  material  related  to  the  topic,  al- 
gorithm development  and  special  case  studies,  can  be  found  in  Soukup  [145],  Cheung  and 
Soukup  [20],  and  Sankoff  and  Rousseau  [134]. 

GSP3:  Gilbert’s  Flow- Dependent  Networks 

Gilbert  [65]  studied  the  shortest  interconnecting  network  that  must  satisfy  a given  matrix 
of  two-way  flows  where  the  cost  of  an  edge  per  mile  is  a function  of  the  edge  capacity.  For 
more  details,  refer  to  van  de  Heyden  [155],  Trietsh  and  Handler  [153],  Du  and  Hwang  [40], 
and  Werner  [165]. 
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GSP4:  Multiphase  Steiner  Networks 

Suppose  we  are  given  an  edge- weighted  graph  with  m pairs  of  vertex  subsets,  where  the 
two  subsets  in  each  pair  are  disjoint.  The  multiphase  Steiner  problem  consists  of  connecting 
for  every  pair  of  vertex  subsets  the  vertices  in  the  first  subset,  via  vertices  that  are  exclusively 
in  the  second  subset,  its  twin  subset,  with  minimum  total  cost  [48].  This  type  of  problem 
arises  in  communication  network  design  [118,  119]  and  in  placement  of  valves  in  vacuum 
systems  [38,  35].  A related  form  of  generalization  is  presented  by  Voss  [157].  For  more 
information  on  the  subject,  the  reader  is  referred  to  Du  and  Miller  [47],  and  Du  and  Pardalos 
[48]. 

GSP5:  Multiconnected  Steiner  Networks 

When  designing  water  or  electricity  supply  networks,  communication  networks  or  other 
types  of  networks,  two  competing  goals  are  often  present:  minimization  of  network  total 
cost  and  maximization  of  reliability.  Reliability  is  expressed  by  the  degree  of  connectivity 
of  the  network,  i.e.,  the  number  of  edge  disjoint  paths  connecting  every  pair  of  given  nodes. 

Given  an  edge-weighted  undirected  graph,  a subset  of  regular  nodes  and  minimum- 
degree-of-connectivity  requirements  between  aU  pairs  of  regular  nodes  (i.e.,  the  minimum 
required  number  of  edge-disjoint  paths  connecting  a pair  of  nodes  in  any  interconnection 
network),  the  multiconnected  Steiner  network  problem  consists  of  finding  the  minimum  cost 
network  that  meets  the  minimum-degree-of-connectivity  requirements  between  the  regular 
nodes  [171,  157].  The  first  to  mention  this  type  of  Steiner  problem  was  Krarup  who,  in 
an  unpublished  1978  memo,  suggested  finding  the  minimum  weight  2-path-connected  (or  2- 
edge-connected)  subgraph  that  spans  a set  of  given  points.  The  reader  is  referred  to  Winter 
[169,  170,  172],  and  Voss  and  Mehr  [161]  for  special  case  studies. 

GSP6:  Degree  Constrained  Steiner  Problem 

This  generalized  form  arises  from  real-world  problems  like  water  supply  systems  or 
optimal  design  of  communication  networks  where  there  are  constraints  on  the  capacity  of 
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nodes.  For  a given  undirected  weighted  graph  and  a regular  subset  of  nodes,  the  problem  is 
to  connect  the  given  regular  subset  of  nodes  with  the  minimum  weight  subgraph  such  that 
there  is  an  upper  bound  for  the  degree  of  incidence  to  every  regular  node  in  the  subgraph, 
see  Voss  [157,  156]  for  some  analysis  and  a branch  and  bound  algorithm. 

GSP7:  Steiner  Problem  in  Directed  Graphs 

This  is  the  most  obvious  generalization  of  the  Steiner  problem.  As  we  will  see  in  chap- 
ter 2,  to  every  undirected  graph,  there  is  a corresponding  directed  graph  obtained  by  cre- 
ating for  every  undirected  arc  two  oppositely  directed  arcs  associated  with  the  same  cost  of 
the  undirected  arc.  Note  that  the  uncapacitated  plant  location  problem  [175]  and  the  set 
covering  problem  [3]  are  special  cases  of  this  generalized  Steiner  problem. 

GSP8:  Node  Weighted  Steiner  Problem 

Given  a node-and-edge- weighted  graph  and  a regular  subset  of  nodes,  the  node- weighted 
Steiner  tree  problem  consists  of  connecting  the  given  regular  subset  of  nodes  with  a subgraph 
that  minimizes  the  total  sum  of  costs  corresponding  to  its  nodes  and  edges.  This  form  of 
Steiner  problem  is  motivated  by  a model  of  distributed  data  processing  [135].  For  further 
analysis  and  practical  realizations  the  reader  is  referred  to  Duin  and  Volgenant  [49],  and 
Iwainsky  et  al.  [81]. 

GSP9:  Network  Design  Problem 

Given  a graph,  the  problem  is  to  find  a subgraph  that  minimizes  the  sum  of  fixed  costs  of 
the  arcs  in  the  subgraph  and  the  costs  for  routing  goods  through  this  subgraph.  This  prob- 
lem can  be  shown  to  be  a special  network  design  problem  [104].  Numerous  transportation 
applications,  ranging  from  water  resource  planning  to  problems  in  flexible  and  automated 
manufacturing  systems,  can  be  represented  under  this  form. 

GSPIO:  Steiner  Problem  with  Basic  Sets 

Given  an  undirected  weighted  graph,  and  a number  of  regular  subsets  of  given  nodes, 
the  problem  is  to  find  the  minimum  cost  connected  subgraph  such  that  the  vertex  set  of 
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this  subgraph  contains  at  least  one  node  of  each  given  regular  subset  of  nodes.  This  discrete 
version  of  the  problem  is  of  importance  in  VLSI  design  [157]. 

GSPll:  Steiner  Forest  Problem 

Given  an  undirected  edge- weighted  graph,  a regular  subset  of  nodes  and  a positive 
integer  the  problem  consists  of  finding  a subgraph  with  minimum  total  cost  such  that 
this  graph  has  at  most  k connected  components  each  containing  at  least  one  regular  node. 
This  problem  arises  in  the  oil  industry  with  relation  to  exploiting  an  offshore  area,  where 
more  than  one  terminal  might  exist  for  pumping  the  oil  ashore  [49],  a problem  whose 
solutions  are  not  necessarily  connected. 

1.5  Objectives  and  Contributions 

In  the  previous  sections,  we  have  given  a brief  survey  of  the  Steiner  problem  with  its 
different  forms,  in  terms  of  existing  algorithms  and  heuristics,  special  case  studies  and 
real-world  apphcations.  However,  the  scope  of  the  survey  is  too  large  as  seen  from  the 
perspective  of  the  study  undertaken  in  this  work.  In  our  Ph.D.  work,  we  address  a specific 
area  of  the  Steiner  problem,  namely  the  Steiner  problem  in  graphs  or  SPG.  Indeed,  we 
propose  three  new  formulations  for  the  problem,  a heuristic  to  give  approximate  solutions 
to  the  problem,  a branch-and-bound  exact  algorithm,  and  the  first  test-problem  generator. 

One  of  the  proposed  formulations  in  chapter  2,  an  integer  formulation,  is  used  to  develop 
lower  bounds  in  chapter  4 for  an  exact  branch-and-bound  scheme,  and  to  devise  the  first 
test-problem  generator  for  the  SPG  in  chapter  5.  Providing  a starting  solution  for  the 
branch-and-bound  algorithm,  a heuristic  based  on  assigning  preference  weights  to  arcs  is 
proposed  in  chapter  3. 

The  contribution  due  to  this  study  is  established  on  the  bases  of  original  ideas  and 
encouraging  experimental  results.  However,  as  concluded  in  chapter  6,  extensions  of  our 
established  results  create  new  avenues  to  be  explored  in  future  research. 


CHAPTER  2 


EQUIVALENT  FORMULATIONS  FOR  THE  STEINER  PROBLEM  IN  GRAPHS 

2.1  Introduction 

Defining  a problem  is  a crucial  step  toward  solving  it.  An  ill  defined  problem  is  certainly 
not  ready  to  be  tackled.  While  defining  or  formulating  a problem  requires  a scientific 
background,  it  is  in  major  part  an  art.  In  this  chapter,  we  present  in  the  third  section 
five  known  and  three  new  mathematical  formulations  for  the  Steiner  problem  on  undirected 
graphs  (SPG)  and  the  Steiner  problem  on  directed  graphs  (SPDG).  These  formulations  are 
of  three  different  types:  mixed  integer  formulations,  integer  formulations  and  continuous 
formulations.  In  the  second  section,  the  relationship  between  the  SPG  and  the  SPDG  is 
explored,  and  we  give  conclusions  in  section  2.4. 

2.2  The  Steiner  Problem  in  Directed  and  Undirected  Graphs 

Two  closely  related  problems  are  the  SPG  and  the  SPDG.  In  this  section,  we  establish 
this  relationship  after  defining  the  two  problems. 

2.2.1  Instance  Formulation  of  the  Steiner  Problem  in  Undirected  Graphs 

Here,  we  give  a definition  for  the  SPG  which  has  an  interesting  matroid  interpretation 
[94,  67].  Let  G = (Af^A^C)  be  an  undirected  graph,  where 
Af  = {1, . . . , n}  is  a set  of  nodes,  and 

is  a set  of  undirected  arcs  (i,  j)  with  each  arc  incident  to  two  nodes,  and 
C is  a set  of  nonnegative  costs  Cij  associated  with  undirected  arcs 
Then,  the  SPG  is  defined  as  follows. 
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Instance:  A graph  G = (A/",  *4,C),  a node  subset  TZ  G A/". 

Question:  Find  the  minimum  cost  tree,  on  G,  that  would  connect  all  the  vertices  in  TZ, 

2.2.2  Instance  Formulation  of  the  Steiner  Problem  in  Directed  Graphs 

The  SPDG  is  the  directed  graph  version  of  the  Steiner  problem.  Apparently,  Nastansky 
et  al.  [108]  are  the  first  researchers  to  consider  the  SPDG.  Consider  where 

Af  = {1, . . . , n}  is  a set  of  nodes,  and 

is  a set  of  directed  arcs  (i,  j)  incident  to  nodes  in  Af. 

= {cfj}  is  a set  of  costs  associated  with  A^. 

Furthermore,  let  7Z  (a  set  of  regular  nodes)  be  a subset  of  A/*,  and  let  r (a  root  node)  be  an 
arbitrary  node  in  TZ.  Also,  define  a directed  Steiner  tree  T rooted  at  node  r,  with  respect  to 
TZ  on  G,  to  be  a directed  tree  where  all  nodes  in  TZ\{r}  can  be  reached  from  r via  directed 
paths  in  T,  and  aU  leaf  nodes  are  in  TZ.  Next,  we  define  the  SPDG. 

Instance:  A graph  G^  = (A/",  a node  subset  TZ  and  a root  node  r in  TZ. 

Question:  Find  the  minimum  cost  directed  Steiner  tree,  on  G with  respect  to  7^,  that  has 
node  r as  root  node. 

2.2.3  A Generahzation  Relationship 

In  the  fight  of  the  SPG  and  SPDG  definitions  given  above,  it  is  not  hard  for  one  to  see 
that  the  SPDG  is  a generalization  of  the  SPG.  In  fact,  the  SPG  can  be  transformed  to  a spe- 
cial case  of  the  SPDG  where  the  arc  cost  structure  is  symmetric.  This  transformation  can 
be  done  as  follows.  Given  a graph  G = (A/^,  ^,C),  create  the  corresponding  directed  graph 
G^  = (A/*,  w4^,C^),  where  every  undirected  arc  (^,J)  G A corresponds  to  two  oppositely 
directed  arcs  (i,j)  and  (j,  i)  G A^  associated  with  the  same  cost  {cj-  — cj-  — Cij).  Po- 
tentially, any  mathematical  formulation  for  the  SPDG  can  be  used  to  formulate  the  SPG. 
Having  highlighted  this  relationship,  we  can  proceed  to  the  next  section  where  different 
mathematical  formulations  are  given  for  the  SPDG  and  the  SPG. 
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2.3  Mathematical  Formulations 

In  this  section,  we  consider  different  mathematical  formulations  for  the  SPDG  and  the 
SPG;  as  mentioned  above,  all  of  those  formulations  are  potentially  for  undirected  graphs 
since  every  undirected  graph  is  associated  with  a symmetric  directed  graph.  The  different 
mathematical  formulations  considered  here  can  be  divided  into  three  categories:  mixed 
integer  formulations,  integer  formulations  and  continuous  formulations.  Whenever  used  in 
the  following  sections  and  chapters,  0{i)  and  I(i)  denote  the  subsets  of  arcs  that  are  going 
out  of  node  i and  into  node  i respectively. 

2.3.1  Mixed  Integer  Formulations 

One  of  the  classes  of  mathematical  models  used  to  formulate  the  SPG  is  mixed  integer 
programming,  where  some  of  the  variables  used  in  the  model  are  integer,  and  others  are 
continuous.  Two  known  mixed  integer  formulations  based  on  network  flow  requirements 
are  given  here. 

Network  Formulation  1 

Given  a directed  graph  and  a subset  of  nodes  7^,  consider  a network 

flow  problem  in  which  the  root  node  r £ TZ  offers  \TZ\  — 1 units  of  flow' and  each  node  in 
TZ\{r}  demands  one  unit.  Let  yij  be  the  amount  of  flow  on  arc  (i,  j),  and  Xij  be  a binary 
variable  corresponding  to  arc  (^,  j).  Then,  consider  the  following  formulation. 

{^i)  minimize  ^ CijXij 

subject  to:  VH  ~ 2/ji  = -1 

{hj)eo(i)  (j,i)ei{i) 

yij  ~ Vji  — 1^1 — 1 

(bi)eo(t)  (i,t)ex(0 

L/  y^j  ~ Xy  yj'  ~ ® 

(i,j)eo(i)  U,i)ei(i) 

0 < Vij  < (|7^|  - 

Xij  € {0, 1} 


(Vi  G n\{r}) 

(for  i — r) 

(Vi  G Af\n) 
(V(i,i)€^'') 
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The  constraints  involving  variables  yij  only  are  simply  flow  conservation  constraints.  The 
coupling  constraints  enforce  the  fact  that  an  arc  can  carry  a positive  flow  only  if  it  is  used 
in  the  tree,  meaning  that  its  corresponding  binary  variable  Xij  is  1. 

There  are  formulations  of  the  same  type  used  in  location  problems  and  in  the  travehng 
salesman  problem  [63,  110].  This  formulation  for  the  SPDG  is  presented  by  Arpin  et  al.  [4] 
where  relaxation  methods  are  apphed  to  it. 

Network  Formulation  2 


Given  a directed  graph  and  a subset  of  nodes  7^,  consider  a multicom- 

modity flow  problem  in  which  the  root  node  r £ TZ  offers  one  unit  of  every  commodity  k 
(/t  = 1 • • • \TZ\  — 1),  where  commodities  correspond  to  nodes  in  TZ\{r}.  Let  y^-  be  the  amount 

4/ 

of  commodity  k (the  amount  of  flow  between  nodes  r and  k)  on  arc  (i,  j),  and  let  Xij  be  a 
binary  variable  that  indicates  whether  or  not  an  arc  (i,  j)  is  included  in  the  solution.  Then, 
the  SPDG  second  formulation  is  given  below. 


{T2)  minimize  ^ cijxij 

subject  to:  ^ Vji 

(iJ)eO(i)  (j,i)el(i) 

E Vij-  E y'i 

E 4-  E 4 

ihj)^0(i)  (j,i)£l(i) 

0 < y-j  < Xij 

Xij  G {0, 1} 


1 (for  i = r,  V/s  G 7^\{r}) 

— 1 {foT  i = k,'ik  £ TZ\{r}) 

0 (for  i ^ {r, /;},  VA;  G 7^\{r}) 

(V(i,i)G^‘') 


The  coupling  constraints  indicate  that  flow  of  any  commodity  is  allowed  on  arc  (i,j)  only 
if  that  arc  is  included  in  the  solution.  In  a way,  we  can  view  X{j  as  the  flow  capacity  for 
commodity  k on  arc  The  first  three  constraints,  involving  y^j  only,  dictate  that  one 

unit  of  commodity  k be  routed  between  nodes  r and  k. 

This  formulation  is  given  by  Wong  [175]  where  a dual  ascent  approach  is  applied  to  it. 
Note  that  formulation  T2  can  be  reduced  in  size  by  aggregating  the  couphng  constraints 
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[175].  However,  a number  of  studies  of  related  problems  such  as  plant  location  and  fixed 
charge  network  design  [30,  103,  125, 126]  have  concluded  that  the  disaggregated  formulation, 
although  it  is  larger  in  size,  is  much  more  useful  for  developing  efficient  algorithms. 

2.3.2  Integer  Formulations 

The  SPG  can  be  approached  as  an  instance  in  the  realm  of  integer  programming.  In 
integer  programming,  formulating  a problem  is  crucially  important  to  solving  it  [109].  In 
this  section,  we  present  three  known  and  two  new  integer  formulations  for  the  SPG  and  the 
SPDG.  Some  of  these  formulations  have  been  approached  with  relaxation  methods  used  to 
develop  lower  bounds  for  a branch  and  bound  scheme  in  some  previous  studies. 

An  Elementary  Path  Formulation 

Given  an  undirected  graph  G = (A/*,  A,C),  a subset  of  regular  nodes  TZ  and  an  arbitrary 
node  r G 7^,  associate  with  every  arc  (i,i)  two  binary  variables  Xij  and  yijk]  X{j  is  one  if 
arc  (i,j)  is  used  in  the  solution  tree,  and  zero  otherwise;  yij^  is  one  if  arc  (i,j)  is  on  the 
unique  elementary  path  in  the  solution  tree  from  node  r to  node  k {k  e and  zero 

otherwise.  The  SPG  is  formulated  as  follows. 

(^3)  minimize  ^ CijXij 

(iJ)eA 

subject  to:  there  exists  an  elementary  path  [yijk]  from  r to  A;  ,VA:  G TZ\{r} 

Xij  > Vijk  (Vfc  G TZ\{r},\/{iJ)  G A) 

e{o,i} 

Vijk  e {0,1}  (VA:G7^\W,V(^,i)G^). 

The  first  constraint  ensures  that  the  solution  is  connected.  The  second  constraint  dictates 
that  an  arc  is  on  some  path  only  if  that  arc  is  used  in  the  tree.  This  formulation  is  given  by 
Beasley  [8]  where  a Lagrangean  relaxation  technique  is  used  to  develop  lower  bounds  for  a 
branch  and  bound  algorithm.  In  this  approach,  redundant  “box  constraints”  are  added  to 
the  formulation  to  strengthen  lower  bounds. 
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A Minimum  Spanning  Tree  Formulation 

This  formulation  is  given  by  Beasley  [9]  where  a branch  and  bound  algorithm  is  devised 
using  lower  bounds  obtained  from  Lagrangean  relaxation  techniques.  Given  an  undirected 
graph  G = (AT,  a subset  of  nodes  TZ  and  an  arbitrary  node  r G 7^,  the  formulation  is 

motivated  by  the  following.  The  SPG  is  to  choose  a subset  of  A that  would  connect  together 
all  nodes  in  TZ  with  the  minimum  cost.  It  is  well  known  that  edges  in  this  subset  of  A will 
form  a minimum  cost  spanning  tree,  the  Steiner  tree,  on  some  subset  of  vertices  7^U<S,  where 
<5  C M\TZ  is  called  the  set  of  Steiner  vertices.  In  order  to  exploit  this  valuable  information, 
let  us  add  an  artificial  node  0 to  the  graph.  Also,  for  each  node  i G N\TZ^  add  an  edge  (0,  i) 
of  cost  zero.  For  node  r,  add  an  edge  (0,  r)  of  cost  zero.  Clearly,  the  SPG  reduces  to  finding 
the  minimum  spanning  tree  (MST)  on  the  augmented  graph  with  the 

additional  constraint  that  in  this  MST,  any  node  i G M\TZ  connected  by  the  arc  (0,  i)  to 
vertex  0 must  have  degree  1 (see  Figure  2.1).  In  more  rigorous  mathematical  terms,  this 
can  expressed  by  the  following  formulation. 

(.F4)  minimize  ^ CijXij 

subject  to:  [^ij]  = spanning  tree  on  G^  = (A/^,w4^,C^) 

XQi  + Xpq  < 1 (for  p OT  q = i,  \/i  G Af^\TZ) 

A Set  Covering  Formulation 

Given  a directed  graph  G^  = (Af^A^^C^)  and  a subset  of  nodes  7^,  consider  the  partition 
oiAf  = XU  X such  that  A n A 0,  7^  H A / 0 and  7^  n A ^ 0.  LetV=^{X,X)  denote 
the  cut-set  of  edges  between  X and  X.  Obviously,  there  are  exponentially  many  such  sets. 
Suppose  that  we  enumerate  such  sets  to  be  T^i,  • • •,T^7r-  Define  constants  apij  (p  G {1  • • -tt}, 

(i,j)  G A^)  to  be  1 if  arc  (i,  j)  is  in  cut  set  Pp,  and  zero  otherwise.  Now,  the  SPDG  can  be 
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Isolated  nodes  connected  Subtree  connected  to  r 

to  0 by  edges  of  cost  0 

Figure  2.1.  A solution  to  the  restricted  MST  problem, 
formulated  as  the  following  set-covering  problem: 

minimize  ^ CijXij 

(iJ)eAd 

subject  to:  apijXij  > 1 (VpG 

(ij)eA^ 

This  formulation  is  given  by  Aneja  [3]  where  a set  covering  algorithm  for  the  SPG  is  pro- 
posed. In  addition,  Chopra  et  al.  [21]  have  used  this  formulation  to  find  lower  bounds, 
useful  for  a branch  and  bound  algorithm,  by  solving  the  relaxed  linear  problem.  The  re- 
laxed version  has  potentially  an  exponential  number  of  constraints;  however,  one  need  not 
consider  aU  of  them,  and  this  can  be  done  using  a row  generation  technique  [3]. 

A Rooted  Tree  Formulation 

In  the  following,  a rooted-tree  formulation,  which  is  used  in  chapter  4 to  develop  lower 
bounds  for  a branch-and-bound  algorithm  and  in  chapter  5 to  provide  optimality  conditions 
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for  a test-problem  generator,  is  presented.  Given  a directed  graph  = (A/^,  a 

subset  of  nodes  TZ  and  a root  node  r £ TZ^  the  SPDG  can  be  formulated  as  follows. 

minimize  ^ CijXij 

{i,j)eA^ 

subject  to:  E = 1 w e K\{r)) 

ihj)ei{j) 

^ Xki  >Xij  e A'^\0{r)) 

[xij]  contains  no  cycles. 

Whether  or  not  an  arc  (i,  j)  is  used  in  the  tree  is  translated  to  assigning  the  value  1 or  0 to 
its  corresponding  binary  variable  Xij.  An  aggregated  form  of  the  above  formulation,  where 
aggregation  is  performed  on  the  second  set  of  constraints,  is  given  by  Nastansky  et  al.  [108]. 
The  first  type  of  constraint  implies  that  at  least  one  arc  is  going  into  every  regular  node  i, 
i G ^\{^}.  The  second  type  of  constraint  dictates  that  for  a directed  arc  (i,  j)  to  be  used 
in  a tree,  at  least  one  tree  arc  has  to  be  going  into  node  z,  i £ A^\{r}.  This  constraint 
and  the  last  constraint  together  imply  the  structure  of  a rooted  tree.  Note  that  there  are 
different  ways  to  represent  the  last  constraint  mathematically.  This  type  of  constraint  is 
encountered  in  formulating  the  traveling  salesman  problem  where  cycles  corresponding  to 
subtours  are  to  be  eliminated  [95]. 

Although  this  formulation  does  not  involve  flow  variables,  it  has  the  feature  of  rooted 
trees  in  common  with  the  network  flow  formulations  given  earlier.  This  combination  of  the 
integer  and  rooted-tree  features  gives  the  formulation  a unique  ingredient.  Also,  note  that 
this  formulation  does  not  involve  potentially  an  exponential  number  of  constraints  like 

A Node- Arc  Integer  Formulation  with  Valid  Cuts 

Here,  we  present  a new  formulation  motivated  by  the  fact  that  we  can  see  an  undirected 
Steiner  tree  on  an  undirected  graph  as  a directed  tree,  on  the  corresponding  directed  graph, 
rooted  at  any  regular  node  (we  have  \TZ\  such  directed  trees).  Given  an  undirected  graph 
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G = (jV,  ^,C),  its  corresponding  symmetric  directed  graph  and  a subset 

of  regular  nodes  TZ,  associate  with  every  undirected  arc  {i,j)  in  G a binary  variable  Xij,  and 

associate  with  every  directed  arc  {i,j)  in  binary  variables  r/^-;  Xij  is  one  if  undirected 

•/ 

arc  (i,  j)  is  used  in  the  solution  tree,  and  zero  otherwise;  yfj  is  one  if  directed  arc  (i,j)  is 
in  a tree  rooted  at  G 7^,  and  zero  otherwise.  Also,  with  every  node  i G Af\TZ^  associate  a 
binary  variable  Si  that  is  one  if  node  i is  in  the  solution  tree,  and  zero  otherwise.  The  SPG 
is  formulated  as  follows. 


(^r) 


• • 


minimize 


E 


^ij  ^ij 


subject  to:  ^ Xij 

> 

1 

(Vz  € 7^) 

> 

2Si 

(Vz  6 V\7^) 

Si 

> 

Xij 

yfj  + Vji 

— 

Xij 

{yken,y{i,j)eA^) 

E Vij 

— 

1 

(VA:  e 7^,Vi  € 7^\{fc}) 

iG)ei(j) 

E 4 

— 

(yk  e 7^,  Vi  € V\7^) 

Vik 

— 

0 

(VA  e n,y{i,k)ei(k)) 

[y^j]  contains  no  cycles 

(VA  e n). 

The  first  three  constraints  ensure  that  in  the  undirected  tree,  every  regular  node  is  connected 
to  the  tree,  and  that  every  Steiner  node  has  at  least  a degree  of  two.  The  fourth  set  of 
constraints  enforce  the  fact  that  every  rooted  tree  solution  is  derived  from  the  unrooted 
tree  by  appropriately  assigning  directions  to  arcs  in  the  undirected  tree  and  vice  versa.  The 
fifth,  sixth,  seventh  and  eighth  sets  of  constraints  ensure  that  the  directed  interconnecting 
network  corresponding  to  node  A;  G is  a directed  tree  rooted  at  k connecting  its  root 
node  to  all  other  regular  nodes  in  7^\{fc},  a property  which  is  present  in  formulation  Tq. 
Although  this  formulation  contains  a relatively  large  number  of  constraints,  most  of  these 
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constraints  are  valid  cuts.  As  a result,  we  expect  this  formulation  to  develop  good  lower 
bounds  for  a branching  scheme. 

2.3.3  Continuous  Formulations 

Every  mathematical  formulation  that  is  not  continuous  because  of  integer  zero-one  vari- 
ables X{  can  be  made  continuous  by  relaxing  the  integrality  constraints,  replacing  them 
with  zero-one  box  constraints  and  adding  to  the  objective  function  a large  penalty  term 
corresponding  to  violated  integer  constraints.  This  is  usually  done  by  adding  a large  M 
term,  if  we  are  dealing  with  a minimization  problem,  of  the  form  x^M{l  — x).  Naturally, 
the  question  now  is  how  large  should  M be?  How  difficult  is  it  to  deal  with  a very  large  M 
factor  in  the  minimization  process?  In  general,  these  questions  are  very  difficult  to  answer, 
see  Pardalos  and  Rosen  [114].  Here,  we  present  a practical  continuous  formulation  in  the 
context  of  continuous  approaches  to  the  SPG.  However,  in  this  formulation,  the  source  of 
concavity  is  stiU  attached  to  the  objective  function.  Indeed,  we  propose  a series  of  continu- 
ous 6-concave  optimization  problems,  and  we  prove  that  the  limit  of  the  series  when  e goes 
to  zero  is  the  fixed  charge  formulation  of  the  SPDG,  see  Guisewite  and  Pardalos  [69].  Also, 
we  give  an  error  bound  on  approximating  the  solution  of  the  SPDG  by  the  solution  of  a 
sequence  in  the  series. 

An  6- Concave  Optimization  Annroach 

Assume  we  are  given  a directed  graph  G = Let  7^  be  a subset  of  nodes  on 

the  graph,  set  of  regular  nodes.  Furthermore,  let  node  r,  the  root  node,  be  an  arbitrary 
node  in  TZ,  In  addition,  let  yij  be  a flow  variable  that  is  associated  with  arc  (i,  j).  Moreover, 
let  6 be  a strictly  positive  real  number.  Now,  define  ^ concave  minimization  problem. 


as  follows. 
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minimize  fs(e,Y)  = Y]  aj — ^ 
subject  to:  AY  = B 

y > 0. 

where  A is  an  |A/"|  X \A^\  node- arc  incidence  matrix,  and  Y is  an  \A^\  X 1 vector  whose 
entries  are  the  y^jS,  and  B is  an  |A/"|  X 1 vector  whose  entries  bi  correspond  to  the  set  of 
nodes;  furthermore,  bi  ==  0 if  i G A/*\7^,  and  b{  = -1  if  i G ^\{^},  and  br  = \TZ\  - 1.  This  is 
basically  a minimum  cost  flow  problem  where  the  objective  function  is  a separable  concave 
function.  Note  that  due  to  the  strict  concavity  of  fe{Y)^  the  unimodularity  of  A and  the 
integrahty  of  5,  the  optimal  solution  to  is  always  integer. 

y.  . 

Now,  if  we  let  Xij  = — - — , the  hmit  of  when  e goes  to  zero  becomes 

Vij  4“  ^ 

(/■g)  minimize  fs{Y)  = CijXij 

subject  to:  AY  = B 

Xij  = 0(1)  for  yij  - 0(>  0)  (V(i,i)  € 

Y >0. 

Observe  that  Ts  is  the  flxed  charge  formulation  of  the  SPG. 
c-Error  Bound 

Here,  we  give  an  e-error  bound  on  the  approximation  of  the  solution  of  the  SPDG  by 
the  solution  of  one  sequence  of  the  series  of  concave  optimization  problems  given  above. 


Theorem  2.1:  The  error  on  approximating  /g  by  /^(cq)  is  less  than 

Proof 

Since  /s(e,y)  is  a convex  and  decreasing  function  of  e,  we  have 


^0 

1 + ^0 
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Conclusions 


In  this  chapter,  we  have  presented  eight  different  formulations  for  the  SPDG  and  the 
SPG.  Those  formulations  faU  into  three  categories,  namely  mixed  integer,  integer  and  con- 
tinuous categories.  Under  the  integer  category,  we  have  proposed  two  new  formulations. 
The  first  one,  formulation  is  a rooted  tree  formulation.  This  formulation  is  used  to 
develop  lower  bounds  for  a branching  scheme  proposed  in  chapter  4 and  to  devise  a test 
problem  generator  in  chapter  5.  The  second  proposed  integer  formulation,  formulation  ^7, 
is  larger  in  size.  However,  the  additional  variables  and  constraints  that  are  involved  ex- 
press vahd  cuts.  Therefore,  this  formulation  can  be  used  to  develop  good  lower  bounds  for 
a branch-and-bound  algorithm.  In  a totally  new  approach,  we  have  proposed  a practical 
continuous  formulation  which  consists  of  approximating  the  SPDG  by  a sequence  in  a series 
of  network  optimization  problems  with  separable  concave  objective  functions  depending 
on  a parameter  c.  This  new  formulation  allows  the  SPG  to  be  treated  with  continuous 
combinatorial  techniques. 

Every  one  of  the  formulations  presented  in  this  chapter  is  vahd.  However,  some  formula- 
tions are  better  than  others  due  to  a better  distribution  of  bad  and  good  properties  between 
the  set  of  constraints  and  the  objective  function.  It  is  weU  known  that  manipulating  an  op- 
timization model  can  be  done  by  either  manipulating  the  constraint  set  or  manipulating  the 
objective  function.  Sometimes,  a formulation  handles  good  optimahty  properties  through 
its  objective  function.  In  this  case,  it  is  a good  idea  to  express  those  properties  in  the 
feasible  domain  by  adding  more  constraints  to  the  problem  to  further  tighten  the  feasible 
region.  An  alternative  situation  is  when  the  set  of  constraints  has  bad  properties  while  the 
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objective  function  has  nice  properties.  It  would  be  a good  idea  in  this  case  to  take  some  of 
the  bad  properties  out  of  the  constraint  set  and  express  them  in  the  objective  function.  To 
make  a long  story  short,  mathematical  models  are  better  off  with  better  shared  information 
between  the  objective  function  and  the  set  of  constraints.  Criteria  to  identify  better  models 
and  systematic  ways  to  formulate  them  are,  however,  open  questions. 


CHAPTER  3 


A HEURISTIC  FOR  THE  STEINER  PROBLEM  IN  GRAPHS 

3.1  Introduction 

As  mentioned  in  chapter  1,  the  Steiner  problem  in  graphs  (SPG)  is  NP-hard.  Therefore, 
developing  heuristics  to  find  approximate  solutions  to  the  problem  becomes  of  importance 
from  a practical  point  of  view.  In  section  3.2,  we  propose  a strongly-polynomial-time  heuris- 
tic where  arcs  are  associated  with  additional  priority  weights  in  order  to  break  ties  among 
alternative  choices  of  shortest  paths  occurring  in  a minimum  spanning  tree  heuristic  ap- 
proach to  solve  the  SPG  based  on  Prim’s  algorithm.  In  section  3.3,  we  study  the  complexity 
of  the  algorithm,  and  we  perform  a worst-case  analysis.  Experimental  results  are  presented 
in  section  3.4,  and  we  conclude  in  section  3.5. 

3.2  A Weighted  Heuristic  Based  on  Prim’s  Algorithm 

Here,  we  present  a polynomial-time  heuristic  for  the  SPG.  In  this  heuristic,  arc  prefer- 
ence weights  are  first  calculated.  Then,  a minimum  spanning  tree  heuristic  approach  based 
on  Prim’s  algorithm  is  employed  to  solve  the  SPG  where  the  calculated  arc  weights  are  used 
to  break  ties  among  shortest  paths  that  are  alternative  choices  for  the  weightless  version  of 
the  heuristic. 

The  procedure  to  find  the  preference  weights  is  a variant  of  the  algorithm.  For  instance, 
if  the  weights  are  chosen  to  be  equal  for  aU  arcs,  then  the  weightless  version  of  the  heuristic 
is  at  hand,  a version  that  is  similar  to  an  algorithm  proposed  by  Takahashi  and  Mutsuyama 
[150].  We  have  studied  the  performance  of  several  weight  versions  of  the  heuristic.  However, 
we  are  going  to  focus  the  scope  of  this  chapter  on  the  most  efficient  weighted  version 
WPSPGH  and  the  weightless  version  PSPGH. 
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Both  WPSPGH  and  PSPGH  maintain  a working  Steiner  subtree  represented  by  two 
working  sets  of  nodes  and  arcs.  This  subtree  contains  at  the  beginning  an  arbitrary  regular 
node  and  no  arcs.  At  every  step,  the  closest  non-subtree  regular  node  to  the  working  Steiner 
subtree  is  first  determined.  Second,  the  shortest  path  to  connect  that  regular  node  to  the 
subtree  is  constructed,  and  the  working  Steiner  subtree  is  updated  to  include  all  nodes  and 
arcs  on  that  shortest  path.  If  there  are  alternative  choices  to  construct  the  shortest  path, 
the  shortest  path  with  the  largest  average  preference  weight,  given  by  the  sum  of  preference 
weights  of  arcs  on  the  shortest  path  divided  by  the  number  of  arcs  on  that  shortest  path, 
prevails.  The  construction  ends  with  a Steiner  tree  after  adding,  one  at  a time,  all  regular 
nodes  to  the  working  Steiner  subtree. 

Although  WPSPGH  and  PSPGH  construct  the  Steiner  tree  in  a similar  fashion,  they 
differ  in  the  preference  weights  used  to  break  ties  among  alternative  choices  for  shortest 
paths.  While  PSPGH  uses  equal  weights  for  all  arcs,  WPSPGH  employs  more  sophisti- 
cated preference  weights.  The  idea  behind  constructing  these  weights  can  be  described  as 
follows.  Every  regular  node  has  an  effect  on  quantifying  the  preference  of  an  arc.  If  a regular 
node  k is  to  be  connected  in  a Steiner  tree  to  another  regular  node  using  a particular  arc 
(i,j)  on  the  connecting  path,  it  is  likely  going  to  be  connected  to  either  regular  nodes  Vi 
or  Tj  that  are  closest  to  nodes  i or  j respectively.  There  are  two  alternatives  to  make  this 
connection.  The  first  alternative  is  to  connect  k to  i,  to  use  (i,j)  and  to  connect  j to  rj. 
The  second  alternative  is  to  connect  k to  j,  to  use  (i,  j)  and  to  connect  i to  r*.  Of  these  two 
alternatives,  the  less  costly  one  is  chosen,  and  the  cost  of  the  chosen  one  is  inversely  scaled 
by  the  cost  of  connecting  k to  the  closest  regular  node  in  to  determine  a positive 

weight  term,  which  is  smaller  than  one,  corresponding  to  the  contribution  of  regular  node 
k to  determining  the  preference  weight  of  arc  Below,  the  weightless  version  of  the 

heuristic  is  given  followed  by  the  weighted  version. 
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Algorithm:  PSPGH 

Input:  G = {JV^A^C),TZ  C J\f. 

Output:  A Steiner  tree. 

BEGIN 

**  Calculate  the  shortest  distances.  ** 

Find  the  shortest  distances  dy.y^  between  V{  and  vj^  \/v{  G 7^,  Vnj  G JV; 

**  Calculate  the  arc  preference  weights.  ** 

V(n^',Uj)  G w4,  set  the  preference  weight  to  1; 

**  Construct  the  Steiner  tree.  ** 

TJV  = {u/c},  where  Vk  is  an  arbitrary  node  in  TZ; 

AfS  = TZ\{vk}; 

TA  = (D; 

WHILE  A/'5  / 0 DO  BEGIN 

determine  G MS^Vn  G TM  such  that  dy^y^  = min{d-i;.^;^ , G AfS^vj  G TAf}; 

find  SP  the  shortest  path  between  Vm  and  that  has  the  largest  average  preference 
weight  given  by  the  sum  of  preference  weights  of  arcs  on  SP  divided  by  the  number 
of  arcs  on  5P; 

determine  SVAfS  and  SVAS  the  set  of  nodes  and  arcs,  respectively,  on  SP\ 

TM  = SVNS\ 

TA  = TA\J  SVAS^, 

ENDDO 


END 
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Algorithm:  WPSPGH 

Input:  G — {J\f ^A^C)^1ZC  M . 

Output:  A Steiner  tree. 

BEGIN 


**  Calculate  the  shortest  distances.  ** 

Find  the  shortest  distances  dy.y^  between  V{  and  Uj,  G 7^,  Vuj  G A/*; 

**  Calculate  the  arc  preference  weights.  ** 

^Vi  G A/",  let  Vi  be  the  closest  node  in  TZ  to  Vi; 

G 7^,  let  r{f  be  the  closest  node  in  7^\{u^}  to 

FORALL  {vi,Vj)  G ^ DO  BEGIN 

IF  (vi  7^  Vj)  THEN 

EdvkTkf 

^ 7^ 1 7’ 

y^^'Jl  ^^^{dvkVi  + dy^rj’fdyj^y^  + dy-j.^} 

ELSE 

let  vi  = Vi  = rj; 


d 


w 


+ E 


virif 


^(vi,Vj)  d"  ^^^{dyj^y-  P dy^j-j^f^  dyj^y^  “f-  dy->p^f^ 

^VkTkf 


Vk 

ENDIF 

ENDDO 


e'R-Xivi}  d"  d^VjTj’fdy^y^  + dy^j.-} 


**  Construct  the  Steiner  tree.  ** 

TM  — where  is  an  arbitrary  node  in  TZ\ 

MS  = 

TA  = 0; 

WHILE  0 DO  BEGIN 

determine  G MS,Vn  G TM  such  that  , Uj-  G MS,Vj  G TM}', 

find  SP  the  shortest  path  between  and  that  has  the  largest  average  preference 
weight  given  by  the  sum  of  preference  weights  of  arcs  on  SP  divided  by  the  number 
of  arcs  on  5P; 

determine  SVMS  and  SVAS  the  set  of  nodes  and  arcs,  respectively,  on  SP\ 

TM  = TMU  SVMS; 

TA^TAli  SVAS; 

MS  = MS\{vm}; 

ENDDO 


END 
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3.3  Complexity  and  Worst-Case  Analysis 

It  is  important  to  study  the  complexity  and  perform  a worst-case  analysis  to  evaluate 
an  algorithm.  AU  different  weight  versions  of  the  heuristic  presented  above  have  three 
time- complexity  terms.  The  first  term  is  due  to  finding  the  shortest  distances  between 
every  regular  node  and  all  nodes.  This  can  be  carried  out  in  both  versions  of  the  heuristic 
in  0{\TZ\  X |A/*P)  time  using  Dijkstra’s  algorithm.  The  second  term  is  due  to  finding  the 
preference  weights.  In  the  case  of  algorithm  PSPGH,  this  is  done  in  0{\A\)  time.  In 
WPSPGH,  this  operation  is  carried  out  in  0(|^|  X |7^|)  time.  The  last  term  contributing 
to  time  complexity  is  the  same  for  both  algorithms,  and  it  is  due  to  constructing  the 
Steiner  tree  by  adding  one  path  segment  at  a time.  Since  the  number  of  Steiner  nodes, 
optional  nodes  in  the  Steiner  tree  with  degree  more  than  two,  is  0(|7^|),  the  number  of 
added  path  segments  is  0{\TZ\).  The  time  complexity  involved  in  finding  every  added 
path  segment  is  due  to  finding  the  shortest  path  between  two  nodes  which  has  the  largest 
average  preference  weight.  This  operation  can  be  done  in  0(|A/*p)  time  by  using  a modified 
Dijkstra’s  algorithm  where  nodes  are  associated  with  four  physical  entities  instead  of  two. 
The  first  two  entities  correspond  to  the  cost  of  the  shortest  path  from  the  root  node  and  the 
index  of  the  predecessor  node  on  that  path.  The  last  two  entities  correspond  to  the  number 
of  arcs  on  that  path  and  the  sum  of  the  preference  weights  corresponding  to  these  arcs. 
During  the  course  of  the  algorithm,  a node  is  labeled  if  it  has  the  largest  average  preference 
weight,  given  by  the  entity  corresponding  to  the  sum  of  the  preference  weights  divided  by 
the  entity  corresponding  to  the  number  of  arcs,  among  the  unlabeled  nodes  which  have  the 
smallest  path  cost  from  the  root  node.  As  a result,  both  algorithms  PSPGH  and  WPSPGH 
have  a total  time  complexity,  due  to  the  three  terms  discussed  above,  of  0{\TZ\  X |Vp). 

By  analyzing  the  worst  case  performance  of  an  algorithm,  one  can  gain  some  insight  of 
the  weakness  and  strength  of  an  algorithm.  The  worst  case  assumed  by  algorithm  PSPGH  is 
the  same  for  all  SPG  heuristics  that  are  based  on  a minimum  spanning  tree  approach.  The 
reader  is  referred  to  Du  [36]  for  a survey  of  Steiner  minimum  tree  worst-case  approximations. 


39 


It  is  known  that  the  length  of  a Steiner  tree  L{MST)  found  by  a minimum-spanning-tree- 
based  heuristic  is  bounded  by 

L{MST)  < 2 * L{SMT)  - Ip,  (3.1) 

where  L{SMT)  is  the  length  of  the  Steiner  minimum  tree,  and  Ip  is  the  length  of  the 
longest  path  between  two  consecutive  regular  nodes  on  the  Steiner  minimum  tree.  In  fact, 
this  bound  is  tight  in  the  case  of  the  wheel  graph  shown  in  Figure  3.1.  In  this  graph,  the 
only  optional  node  is  the  center  of  the  wheel,  and  arcs  between  two  consecutive  nodes  on 
the  exterior  of  the  wheel  have  a cost  of  two,  and  arcs  joining  the  external  nodes  to  the  center 
have  a cost  of  one.  While  the  Steiner  minimum  tree  of  this  graph,  which  is  the  star  that 
connects  the  exterior  regular  nodes  to  the  center  of  the  wheel,  is  of  cost  4,  the  worst-case 
Steiner  tree  found  by  PSPGH,  which  connects  the  regular  nodes  on  the  boundary  without 
using  the  optional  center  node,  is  of  cost  6.  In  this  case,  the  worst  case  bound  given  above 
is  tight. 

If  applied  to  the  graph  given  in  Figure  3.1,  WPSPGH  associates  higher  preference 
weights  with  arcs  between  the  regular  nodes  on  the  boundary  and  the  center  nodes.  In 
fact,  arcs  between  nodes  on  the  boundary  have  preference  weights  equal  to  3 while  arcs 
between  nodes  on  the  boundary  and  the  center  node  have  preference  weights  equal  to  4. 
As  a result,  the  algorithm  wiU  take  the  alternative  shortest  path  that  goes  through  the 
center,  and  the  Steiner  tree  found  by  WPSPGH  is  eventually  the  optimal  solution.  How- 
ever, this  optimal  choice  is  highly  sensitive  to  the  arc  costs.  For  instance,  if  we  perturb 
the  cost  of  all  arcs  incident  to  the  center  node  by  a very  small  positive  number  e as  shown 
in  Figure  3.2,  then  there  is  only  one  choice  for  every  shortest  path  between  two  regular 
nodes,  and  WPSPGH  will  find  the  worst-case  Steiner  tree  found  by  PSPGH  above.  At  the 
limit  when  e goes  to  zero,  the  worst  case  bound  is  given  by  (3.1).  Hence,  in  the  general 
graph  case,  there  is  no  improvement  of  WPSPGH  over  PSPGH  in  the  worst  case  analy- 
sis. However,  in  some  cases  of  more  structured  graphs  fike  grid  graphs  where  alternative 
choices  for  shortest  paths  always  exist,  pathological  cases  of  high  sensitivity  like  the  one 
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Figure  3.1.  A Steiner  problem  on  a wheel  graph  where  fiUed  circles  are  regular  nodes. 

presented  above  do  not  exist.  Therefore,  we  conjecture  that  the  worst-case  bound  corre- 
sponding to  WPSPGH  is  better  than  the  worst-case  bound  corresponding  to  PSPGH,  and 
to  aU  minimum-spanning-tree-based  heuristics  for  the  Steiner  problem  in  rectilinear  graphs 
(SPRG),  given  by  L{SMT)  * 3/2  [74]. 

\ 

3.4  Experimental  Results 

Although  the  worst-case  analysis  is  of  importance  to  understand  the  inherit  complexity 
of  the  problem,  evaluating  the  average  computational  performance  of  the  algorithm  on  a 
variety  of  problem  instances  is  of  crucial  importance.  Codes  PSPGH  and  WPSPGH  have 
been  written  in  FORTRAN  77  and  run  on  a SUN  SPARC  workstation.  The  problems  used 
to  test  the  performance  of  the  algorithms  form  a variety  of  problems.  These  test  problems 
range  from  rectilinear  grid  graphs  to  different  types  of  general  graphs. 

On  grid  graphs,  PSPGH  and  WPSPGH  have  been  tested  using  the  Soukup  and  Chow 
[147]  test  problems  for  which  Beasley  [11]  reported  solutions  found  by  his  state  of  the 
art  algorithm  for  the  SPRG;  this  algorithm  employs  local  optimization  based  on  a four- 
point  Steinerization  technique.  The  performance  of  the  heuristics  is  reported  in  Table  3.1. 
Problem  numbers  and  the  corresponding  numbers  of  regular  nodes  distributed  in  the  unit 
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Figure  3.2.  A Steiner  problem  on  a wheel  graph  with  perturbed  costs  where  fiUed  circles 
are  regular  nodes. 

square  are  given  in  the  first  and  second  columns  respectively.  Costs  of  Steiner  trees  which 
are  found  by  PSPGH,  PWSPGH  and  Beasley’s  algorithm  are  given  in  the  third,  fourth 
and  fifth  columns  respectively.  Optimal  solutions,  if  known,  are  given  in  column  six.  As 
can  be  seen  from  Table  3.1,  WPSPGH  is  competitive  with  Beasley’s  algorithm.  Eventually, 
WPSPGH  finds  better  solutions  for  some  problems  for  which  the  optimal  solution  is  not 
known.  In  addition,  WPSPGH  finds  better  solutions  than  PSPGH  for  22  problems  out  of 
the  46  test  problems.  In  some  problems,  the  improvement  over  solutions  found  by  PSPGH 
is  about  15%.  Out  of  the  remaining  24  problems,  PSPGH  finds  slightly  better  solutions  for 
only  two  problems,  a fact  due  to  pure  luck  since  PSPGH  does  not  have  any  systematic  way 
to  choose  among  alternative  shortest  paths  during  the  construction  of  the  Steiner  tree. 

Table  3.2,  Table  3.3,  Table  3.4  and  Table  3.5  report  the  performance  of  the  heuristic  on 
general  graph  problems.  The  problem  number  is  given  in  column  1.  The  number  of  nodes, 
arcs  and  regular  nodes  in  the  graph  are  given  in  columns  2,3  and  4 respectively.  Steiner  tree 
costs  found  by  PSPGH  and  WPSPGH  are  given  in  columns  5 and  6 respectively.  Problem 
sets  corresponding  to  Table  3.2  and  Table  3.3,  obtained  from  Beasley’s  hbrary  for  test 
problems  [10],  and  problems  corresponding  to  Table  3.4,  constructed  by  the  test-problem 
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generator  for  the  SPG  proposed  in  chapter  5,  are  examples  of  general  graph  Steiner  problems 
where  WPSPGH  does  little.  In  fact,  WPSPGH  improves  over  the  solution  of  PSPGH  in 
only  1 problem  out  of  18  in  Table  3.2,  1 problem  out  of  15  in  Table  3.3  and  none  of  the 
problems  in  Table  3.4.  This  is  merely  due  to  the  fact  that  in  these  problems  there  are  not 
many  choices  for  alternative  shortest  paths  between  nodes  in  the  graph.  To  highhght  this 
fact,  we  have  tested  both  weighted  versions  of  the  heuristic  on  a set  of  random  general 
graphs  which  have  the  property  that  alternative  choices  for  shortest  paths  between  nodes 
in  the  graph  exist.  Clearly,  Table  3.5  suggests  that  WPSPGH  is  superior  to  PSPGH  in  this 
situation  where  it  finds  better  solutions  in  12  problems  out  of  20. 

3.5  Conclusions 

In  this  chapter,  we  have  presented  a weighted  heuristic  for  the  Steiner  problem  in  graphs 
based  on  Prim’s  algorithm  for  the  minimum  spanning  tree.  In  this  heuristic  preference 
weights  associated  with  arcs  are  used  to  break  ties  among  alternative  choices  of  shortest 
paths  occurring  during  the  course  of  the  algorithm  which  constructs  the  Steiner  tree  by 
adding  one  path  segment  at  a time.  We  analyzed  the  performance  of  two  weight  versions, 
PSPGH  and  WPSPGH,  of  the  heuristic.  In  the  first  version,  arcs  are  associated  with  equal 
preference  weights,  a case  that  is  similar  to  an  algorithm  that  is  known  in  the  literature. 
The  second  version  is  the  most  efficient  among  all  the  different  weight  versions  we  have 
studied.  Experimental  results  have  shown  that  for  a class  of  graphs  having  the  property 
that  alternative  choices  for  shortest  paths  between  nodes  in  the  graph  exist,  WPSPGH 
performs  better  than  PSPGH.  Moreover,  in  rectihnear  grid  graphs,  a type  of  graphs  that 
possesses  the  property  mentioned  above,  not  only  does  WPSPGH  perform  better  than 
PSPGH,  but  also  it  is  competitive  with  the  state  of  the  art  algorithm  designed  specifically 
for  rectilinear  grid  graphs.  To  summarize,  the  weighted  version  of  the  heuristic  presented 
in  this  chapter  proves  to  be  efficient.  However,  the  search  for  weight  versions  that  are  more 
efficient  is  open. 
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In  the  next  chapter,  we  will  use  the  heuristic  presented  in  this  chapter  to  find  a feasible 
integer  solution  which  provides  a starting  upper  bound  for  a proposed  branch-and-bound 
algorithm. 
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Table  3.1.  Performance  on  SPRG  problems. 


Prob.  # I |7^|  I PSPGH  I WPSPGH  I BSPRGH  I OPTIMAL 


1 

5 

1.87 

1.87 

1.87 

1.87 

2 

6 

1.83 

1.68 

1.68 

1.64 

3 

7 

2.59 

2.36 

2.36 

2.36 

4 

8 

2.62 

2.54 

2.54 

2.54 

5 

6 

2.29 

2.29 

2.29 

2.26 

6 

12 

2.68 

2.58 

2.48 

2.42 

7 

12 

2.74 

2.54 

2.54 

2.48 

8 

12 

2.52 

2.52 

2.42 

2.36 

9 

7 

1.64 

1.64 

1.72 

1.64 

10 

6 

1.84 

1.84 

1.84 

1.77 

11 

6 

1.52 

1.44 

1.44 

1.44 

12 

9 

1.9 

1.8 

1.8 

1.8 

13 

9 

1.5 

1.5 

1.5 

1.5 

14 

12 

2.6 

2.6 

2.6 

2.6 

15 

14 

1.55 

1.48 

1.48 

? 

• 

16 

3 

1.6 

1.6 

1.6 

1.6 

17 

10 

2.0 

2.0 

2.01 

2.0 

18 

62 

4.27 

4.24 

4.06 

? 

• 

19 

14 

1.92 

1.93 

1.9 

? 

• 

20 

3 

1.12 

1.12 

1.12 

1.12 

21 

5 

2.3 

1.96 

2.16 

1.92 

22 

4 

0.63 

0.63 

0.63 

0.63 

23 

4 

0.65 

0.65 

0.65 

0.65 

24 

4 

0.33 

0.3 

0.3 

0.3 

25 

3 

0.23 

0.23 

0.23 

0.23 

26 

3 

0.15 

0.15 

0.15 

0.15 

27 

4 

1.43 

1.33 

1.33 

1.33 

28 

4 

0.28 

0.24 

0.24 

0.24 

29 

3 

2.1 

2.0 

2.0 

2.0 

30 

12 

1.1 

1.1 

1.1 

1.1 

31 

14 

2.71 

2.71 

2.6 

? 

• 

32 

19 

3.3 

3.3 

3.23 

? 

• 

33 

18 

2.83 

2.73 

2.69 

? 

• 

34 

19 

2.61 

2.61 

2.54 

? 

• 

35 

18 

1.62 

1.58 

1.54 

? 

• 

36 

4 

0.9 

0.9 

0.9 

0.9 

37 

8 

0.93 

0.9 

0.9 

0.9 

38 

14 

1.66 

1.66 

1.66 

? 

• 

39 

14 

1.68 

1.66 

1.66 

? 

• 

40 

10 

1.63 

1.63 

1.62 

1.55 

41 

20 

2.43 

2.35 

2.24 

? 

• 

42 

15 

1.61 

1.56 

1.53 

1.53 

43 

16 

2.67 

2.72 

2.66 

? 

• 

44 

17 

2.6 

2.56 

2.61 

? 

• 

45 

19 

2.28 

2.24 

2.26 

? 

• 

46 

16 

1.5 

1.5 

1.5 

1.5 

45 


Table  3.2.  Performance  on  set  steinb  of  SPG  problems. 


Prob.  # I lA/l  I I |7^|  I PSPGH  I WPSPGH  I OPTIMAL 


5tr 

6T 

9 

W 

55“ 

m 

b2 

50 

63 

13 

88 

83 

83 

b3 

50 

63 

25 

138 

138 

138 

b4 

50 

100 

9 

59 

59 

59 

b5 

50 

100 

13 

62 

62 

61 

b6 

50 

100 

25 

127 

127 

122 

b7 

75 

94 

13 

111 

111 

111 

b8 

75 

94 

19 

104 

104 

104 

b9 

75 

94 

38 

220 

220 

220 

blO 

75 

150 

13 

87 

87 

86 

bll 

75 

150 

19 

92 

92 

88 

bl2 

75 

150 

38 

174 

174 

174 

bl3 

100 

125 

17 

178 

178 

165 

bl4 

100 

125 

25 

245 

245 

235 

bl5 

100 

125 

50 

322 

322 

318 

bl6 

100 

200 

17 

133 

133 

127 

bl7 

100 

200 

25 

132 

132 

131 

bl8 

100 

200 

50 

222 

222 

218 

Table  3.3.  Performance  on  set  steinc  of  SPG  problems. 


Prob.  # I lA/l  I \A\  I |7^|  I PSPGH  I WPSPGH  I OPTIMAL 


cl 

500 

625 

5 

85 

85 

85 

c2 

500 

625 

10 

144 

144 

144 

c3 

500 

625 

83 

766 

766 

754 

c4 

500 

625 

125 

1095 

1095 

1079 

c5 

500 

625 

250 

1592 

1592 

1579 

c6 

500 

1000 

5 

56 

56 

55 

c7 

500 

1000 

10 

102 

102 

102 

c8 

500 

1000 

83 

529 

529 

509 

c9 

500 

1000 

125 

726 

726 

707 

clO 

500 

1000 

250 

1110 

1110 

1093 

cll 

500 

2500 

5 

34 

32 

32 

cl2 

500 

2500 

10 

48 

48 

46 

cl3 

500 

2500 

83 

267 

267 

258 

cl4 

500 

2500 

125 

339 

339 

323 

cl5 

500 

2500 

250 

564 

564 

556 

i 
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Table  3.4.  Performance  on  set  spgtp  of  SPG  problems. 


Prob.  # 

N 

A 

7Z 

PSPGH 

WPSPGH 

OPTIMAL 

spgtp 1 

20 

40 

5 

986 

986 

986 

spgtp2 

20 

40 

5 

3562 

3562 

3562 

spgtp3 

20 

40 

7 

2251 

2251 

2213 

spgtp4 

20 

40 

8 

2982 

2982 

2864 

spgtp5 

20 

40 

10 

2790 

2790 

2776 

Table  3.5.  Performance  on  set  spg  of  SPG  problems. 


Prob.  # I lA/l  I 1^1  I \R\  PSPGH  I WPSPGH 


spgl 

100 

500 

5 

26 

26 

spg2 

100 

1000 

5 

41 

38 

spg3 

100 

2000 

5 

41 

41 

spg4 

100 

3000 

5 

17 

17 

spg5 

100 

4000 

5 

17 

17 

spg6 

100 

500 

20 

75 

74 

spg7 

100 

1000 

20 

80 

79 

spg8 

100 

2000 

20 

105 

103 

spg9 

100 

3000 

20 

101 

99 

spglO 

100 

4000 

20 

133 

131 

spgll 

100 

500 

40 

128 

128 

spgl2 

100 

1000 

40 

144 

144 

spgl3 

100 

2000 

40 

153 

150 

spgl4 

100 

3000 

40 

168 

156 

spgl5 

100 

4000 

40 

163 

158 

spgl6 

100 

500 

60 

197 

197 

spgl7 

100 

1000 

60 

188 

188 

spgl8 

100 

2000 

60 

201 

192 

spgl9 

100 

3000 

60 

208 

194 

spg20 

100 

4000 

60 

221 

195 

CHAPTER  4 


A BRANCH  AND  BOUND  ALGORITHM  FOR  THE  STEINER  PROBLEM  IN  GRAPHS 

4T  Introduction 

All  the  known  exact  approaches  to  the  Steiner  problem  in  graphs  (SPG)  are  based  on  a 
kind  of  enumeration  technique.  Some  of  these  approaches  are  smarter  than  others  in  terms 
of  avoiding  total  enumeration  by  using  properties  of  optimality  or  by  using  good  lower  and 
upper  bounds  to  reduce  the  search  space  in  size.  One  of  those  smart  approaches  is  the  branch 
and  bound  procedure  where  nodes  in  the  binary  search  tree  correspond  to  binary  branching 
variables.  For  any  branch  and  bound  procedure,  the  choices  of  preprocessing  strategies, 
lower  bound  generators  and  upper  bounds  are  crucial  in  order  to  reduce  the  total  number 
of  nodes  in  the  search  tree.  In  the  second  section  we  propose  a set  of  reduction  techniques 
in  order  to  preprocess  the  original  graph.  A starting  upper  bound  is  proposed  in  the  third 
section.  In  the  fourth  section,  a lower  bound  generator  is  proposed.  In  section  4.5,  branching 
strategies  are  discussed.  A branch  and  bound  algorithm  is  presented  in  section  4.6.  In 
section  4.7,  we  report  experimental  results,  and  we  conclude  in  section  4.8. 

4.2  Graph  Preprocessing 

As  experience  has  shown,  preprocessing  an  integer  program  reduces  the  computational 
burden  on  an  exact-solution  procedure.  Given  a graph  G = (AA,  M,C)  and  a subset  of 
nodes  7^,  we  present  in  the  following  three  reduction  rules  for  the  SPG  which  are  easy  to 
implement. 

Edge  Contraction 

Edge  contraction  can  be  performed  in  two  cases.  The  first  case  corresponds  to  the 
situation  where  the  degree  of  a regular  node  i is  one.  In  such  a case,  the  only  arc  (i,j) 
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incident  to  i has  to  be  in  any  solution  tree.  As  a result,  (i,j)  can  be  contracted.  In  the 
second  case,  let  i and  j be  two  regular  nodes  of  TZ.  If  C{j  < Cik  for  all  nodes  k adjacent  to  i, 
then  it  is  easy  to  see  that  arc  (i,  j)  is  a part  of  an  optimal  solution.  Therefore,  edge  (i^j) 
can  be  contracted.  After  the  contraction,  if  parallel  edges  or  arcs  occur,  only  the  edge  with 
the  smallest  cost  is  retained. 

Edge  Deletion 

Suppose  there  exist  k £ TZ  and  i,  j G Af  such  that  Cij  > maxlq/^, Cj/^}.  Then  edge  (^,J) 
can  be  deleted  from  the  graph  for  the  simple  reason  that  it  wiU  never  be  a part  of  an  optimal 
solution.  In  fact,  suppose  that  an  optimal  solution  contains  arc  (i,  j),  then  we  can  make  an 
improvement  by  replacing  (z,  j)  by  either  (i^k)  or  (j, A;),  a contradiction. 

Node  Deletion 

There  are  two  situations  where  nodes  not  in  TZ  can  be  deleted.  The  first  case  is  when 
the  degree  of  the  node  is  equal  to  one,  an  obvious  case  since  this  node  will  never  be  in  a 
Steiner  tree.  In  the  second  situation,  the  degree  of  the  node  is  equal  to  two.  In  such  a 
case,  if  the  node  is  to  be  in  a Steiner  tree,  both  incident  arcs  have  to  be  used.  Therefore, 
in  a reduced  equivalent  graph,  this  node  can  be  deleted  and  its  two  incident  arcs  can  be 
replaced  by  a single  arc  of  cost  equal  to  the  total  of  their  costs  and  with  incident  nodes  the 
two  adjacent  nodes  of  the  node  in  question;  if  parallel  edges  occur  in  this  process,  only  the 
edge  with  the  smallest  cost  is  kept. 

4.3  An  Upper  Bound 

A good  starting  feasible  solution  provides  a good  upper  bound  which  combined  with 
good  lower  bounds  causes  early  node  pruning  in  the  search  tree,  an  essential  objective.  In 
the  course  of  the  branch  and  bound  algorithm,  the  upper  bound  is  updated  whenever  a 
better  integer  feasible  solution  is  identified.  Usually,  the  starting  upper  bound  is  provided 
by  a heuristic.  For  this  purpose,  we  use  the  polynomial-time  heuristic  WPSPGH  proposed 
in  chapter  3. 
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4.4  A Lower  Bound  Generator 

A good  technique  to  generate  lower  bounds  is  necessary  in  order  to  reduce  the  possible 
search  space.  In  this  section,  we  propose  a lower  bound  obtained  by  relaxing  the  integrality 
constraints  in  one  of  the  integer  formulations  proposed  in  chapter  2.  Given  a graph  G = 
(AT, w4,C),  construct  its  corresponding  symmetric  directed  graph  TZC  M 

and  r G 7^  as  mentioned  in  chapter  2.  Consider  the  following  variant  of  formulation 


minimize  ^ CijXij 

(4.1) 

subject  to:  ^ X{j 

= 1 

(Vj  G 7^\{r}) 

(4.2) 

(iJ)er(i) 

^ ^ij 

(V(i,i)G  V\0(r)) 

(4.3) 

Vi  - Vj  + W\xij 

< 1 VI  - 1 

(V(i,i)G^") 

(4.4) 

Xij  e {0, 1} 

(V(i,i)€  V) 

(4.5) 

Vi 

> 0 

(Vi  G V). 

(4.6) 

Involving  additional  nonnegative  variables  corresponding  to  nodes,  constraint  (4.4)  elimi- 
nates directed  cycles,  which  correspond  to  tours  and  subtours;  this  type  of  constraint  is 
used  in  formulating  the  traveling  salesman  problem  [109,  95].  Let  CVe  be  the  linear  pro- 
gram obtained  lorm  J-q  by  relaxing  the  integrality  constraint.  In  the  process  of  generating 
lower  bounds  for  the  branch  and  bound  algorithm  proposed  in  section  4.6,  every  node  in  the 
search  tree  is  associated  with  a pair  of  arc  subsets  {Aq^Ai)  and  a lower  bound  LB^{Aq^A\) 
which  is  the  optimal  objective  value  of  CVe  where  the  binary  variables  corresponding  to 
arcs  in  ^nd  Ai  are  fixed  to  zero  and  one  respectively. 

The  choice  of  the  Steiner  tree  root  node  r used  in  the  mathematical  model  above  has  an 
effect  on  the  quality  of  the  lower  bound.  One  way  to  determine  the  root  node  is  to  choose 
among  the  \TZ\  regular  nodes  the  root  node  that  has  the  largest  LP-relaxation  objective 
value  to  provide  the  largest  starting  lower  bound.  Another  way  to  determine  the  root  node 
is  to  choose  among  the  regular  graph  nodes  the  one  that  has  the  largest  average  preference 
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weights  found  by  dividing  the  sum  of  preference  weights^  for  arcs  emanating  from  the  node 
by  the  number  of  these  arcs.  For  the  purpose  of  efficiency,  the  second  alternative  to  choose 
the  root  node  is  adopted  for  the  branch  and  bound  algorithm. 

4.5  Branching  Strategies 

In  a branch-and-bound  scheme,  there  are  two  different  aspects  of  the  branching  strategy 
related  to  constructing  and  exploring  the  search  tree. 

Choosing  the  Branching  Variable 

In  constructing  the  search  tree,  choosing  the  branching  variable  at  a parent  node  has  to 
be  done  in  a systematic  way.  One  alternative  to  choose  the  branching  variable  is  to  choose 
the  variable  with  value  above  a threshold  value  close  to  one  and  whose  corresponding  arc 
has  the  smallest  depth. ^ With  this  rule,  good  solutions  are  found  relatively  fast.  However, 
to  prove  optimality,  it  takes  a very  long  time.  The  other  way  to  choose  the  branching 
variable  is  to  take  the  smallest-depth  arc  whose  value  is  within  an  interval  centered  at 

0. 5  and  length  equal  to  twice  a threshold  value,  a value  that  is  close  to  0.1  and  could  be 
increased  if  needed.  With  this  rule,  it  takes  time  to  find  good  solutions;  however,  it  takes 
less  time,  relatively  speaking,  to  prove  optimality.  Due  to  the  good  quality  of  the  starting 
upper  bound  solution,  the  second  alternative  is  adopted  for  the  sake  of  efficiency  in  proving 
optimality. 

Exploring  the  Search  Tree 

There  are  several  alternatives  to  deal  with  the  second  aspect  of  the  branching  strategy, 

1. e,  exploring  the  search  tree.  Examples  of  such  alternatives  are  breadth-first  search,  best- 

first  search  and  depth-first  search.  The  breadth-first  search  and  best-first  search  require 

more  storage  space.  However,  they  explore  the  search  tree  in  a parallel  fashion  making  the 

chance  of  finding  good  solutions  better  which  in  turn  eliminates  exploring  bad  branches  in 
^The  preference  weights  in  question  are  mentioned  in  chapter  3. 

The  depth  of  an  arc  is  defined  as  the  smallest  number  of  arcs  it  takes  to  reach  the  beginning  node  of 
that  arc  from  the  root  node. 
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the  search  tree.  Depth-First  search,  on  the  other  hand,  requires  much  less  memory  storage 
space,  at  the  expense  of  increasing  the  chance  of  exploring  bad  branches  in  the  search  tree. 
With  a good  starting  upper  bound  hke  the  one  we  proposed  in  section  4.3,  this  chance  of 
exploring  bad  branches  in  the  search  tree  is  reduced,  and  the  depth-first  search  becomes  an 
effective  strategy  for  the  branch-and-bound  algorithm  proposed  in  the  following  section. 

4.6  A Branch  and  Bound  Algorithm 

In  this  section,  we  propose  a branch  and  bound  algorithm  for  the  SPG  which  maintains 
a branching  binary  tree.  Consider  a graph  G = (Af^A^C)  and  its  corresponding  symmetric 
directed  graph  obtained  as  mentioned  in  chapter  2.  Every  node  i in  the 

branching  tree  is  associated  with  a set  of  directed  arcs  G A^  whose  variables  are  fixed 
to  zero,  a set  of  directed  arcs  A\  G A^  whose  variables  are  fixed  to  one,  a lower  bound 
LBq^Aq^Ai)  obtained  as  mentioned  in  the  fourth  section  and  a branching  arc  variable 
obtained  as  mentioned  in  the  fifth  section.  The  algorithm  explores  the  set  of  active  nodes, 
leaf  nodes  in  the  branching  tree  whose  lower  bounds  are  less  than  the  best  known  upper 
bound,  in  a depth-first-search  manner.  In  this  fashion,  the  upbranch  at  a node  is  completely 
exhausted  before  the  downbranch  at  the  same  node  is  considered.  We  chose  this  way  to 
explore  the  search  tree  because  it  is  very  efficient  in  terms  of  memory  space  requirements 
and  because  we  start  with  a very  good  upper  bound  which  reduces  the  need  for  exploring 
the  search  tree  in  parallel  fashion. 

The  active  nodes  are  sorted  in  a stack  according  to  a selection  rule,  which  is  the  depth- 
first-search  rule  in  this  case.  The  state  of  the  branching  tree  is  represented  by  six  variables: 
BBN ^ BBAN ^ BBCN^  BBBV{i)^  BBLB{i)  and  BBP(^i).  BBN  is  the  total  number  of 
nodes  in  the  search  tree.  BBAN  is  the  total  number  of  active  nodes  in  the  stack.  BBCN 
is  the  current  node  to  be  analyzed.  BBBV (i)  contains  the  index  of  the  branching  variable 
associated  with  node  i in  the  branching  tree.  BBLB(i)  contains  the  lower  bound  value 
associated  with  node  i in  the  search  tree.  BBP(i)  points  out  to  the  parent  node  of  node  i 
in  the  branching  tree;  if  BBP{i)  = j > 0 or  BBP{i)  = -j  < 0,  then  i is  an  upbranch  or 
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downbranch,  respectively,  of  j.  In  what  follows,  we  propose  an  algorithm  for  the  SPG. 


SPG  Algorithm 

Given:  A graph  G = (A/*,  A,C)  and  a subset  of  regular  nodes  IZ. 

Output:  A Steiner  minimum  tree. 

Step  0 

• Do  graph  preprocessing  (see  section  4.2) 

• Find  a starting  upper  bound  ub  (see  section  4.3). 

• Find  a starting  lower  bound  lb  = see  section  4.4. 

• If  lb  is  equal  to  ub  or  if  the  starting  lower  bound  solution  is  integer,  then  stop  an 
optimal  solution  with  value  lb  is  at  hand. 

• Create  the  first  active  node  as  follows:  BBN  = l^BBAN  — 1-^  Aq  = A\  = 0; 
BBLB(1)  — lb]  BBP(l)  = 0;  choose  the  branching  variable  bv  (see  section  4.5), 
BBBV{1)  = bv]  BBCN  = 1. 

• Go  to  step  1. 
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Step  1:  Upbranch 

• Let  k = BBC  IS,  and  let  {i,j)  BBBV{k). 

• Let  ^0  = -^0  ^ i)  ^ s.t.  / 7^  i}.  Let  U {(i,  j)},  and  find 

lb  = LBq{Aq,Ai),  see  section  4.4. 

• If  lb  is  larger  than  or  equal  to  ub  then  go  to  step  2. 

• If  the  lower  bound  solution  is  integer,  then  ub  = lb,  and  go  to  step  2. 

• Create  an  active  node  as  follows:  BBN  = BBN  + 1;  BBAN  = BBAN  + 1; 

BBLB(BBN)  = lb;  BBP(BBN)  = +k;  choose  the 
branching  variable  bv  (see  section  4.5),  BBBV(BBN)  = bv; 

• BBCN  = BBN. 

• Repeat  step  1. 

Step  2:  Downbranch 

• Let  A;  = BBCN,  and  let  (i,j)  = BBBV{k). 

• Let  ^0  = ^0  ^ Let  Ai  = Ai,  and  find  lb  = LBe(Ao,Ai)  , see  section  4.4. 

• If  lb  is  larger  than  or  equal  to  ub  then  go  to  step  3. 

• If  the  lower  bound  solution  is  integer,  then  ub  = lb,  and  go  to  step  3. 

• Create  an  active  node  as  follows:  BBN  = BBN  + 1;  BBAN  = BBAN  + 1; 

^ Ao,Af^^  = Ai;  BBLB(BBN)  = lb;  BBP{BBN)  = -k;  choose  the 
branching  variable  bv  (see  section  4.5),  BBBV(BBN)  = bv; 

• BBCN  = BBN. 

• Go  to  step  1. 

Step  3 

• Let  k = BBP(BBCN). 

• If  k is  positive,  then  set  BBCN  to  k,  and  go  to  step  2. 

• If  A;  is  negative,  then  decrease  BBAN  by  one. 

• If  BBAN  — 0,  stop.  There  are  no  more  active  nodes,  and  the  best  known  integer 
solution  is  the  optimal  solution  with  value  ub. 


Repeat  step  3. 
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Figure  4.1.  A directed  grid  graph  with  unit-cost  edges. 


>4®  _ 0 
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integer 

Figure  4.2.  A branch  and  bound  tree  with  a starting  upper  bound  of  9. 


Example 

Here,  we  iUustrate  how  the  branch  and  bound  algoritlmi  works.  Consider  a imit-edge-cost  grid  graph  where  the 
arcs  of  the  corresponding  directed  graph  are  numbered  as  shown  in  Figure  4.1,  and  the  regular  nodes  are  the  nodes 
marked  by  squares.  If  apphed  to  this  grid  graph,  WPSPGH,  proposed  in  section  4.3,  yields  a Steiner  tree  of  length 
9.  Also,  node  11  is  chosen  as  the  Steiner  tree  root  node  according  to  the  rule  mentioned  in  section  4.4.  As  shown  in 
Figure  4.2,  the  branch  and  bound  algorithm  given  above  considered  one  node  in  the  branching  tree  before  finding  an 
optimal  solution  of  value  8.  The  operations  performed  at  the  branching  node  are  presented  below. 


node  1:  The  optimal  solution  of  CVq  at  this  node  is  X2  = 0.354, 074  = 0.354, = 0.646, X19  = = 1?^22  = 

0.646,3737  = IjiPao  = 1j3^26  = 1?^32  = 1-  The  branching  variable  associated  with  this  node  is  3722-  The  lower 
boimd  at  node  1 is  equal  to  8. 

The  upbranch  at  node  1 yields  an  integer  solution  of  value  8,  and  the  downbranch  yields  a solution  with  an  objective 
value  of  8.  Therefore  both  branches  are  pruned  (not  added  to  the  set  of  active  nodes),  and  the  search  tree  contains 
only  one  node. 
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4.7  Preliminary  Experimental  Results 

The  algorithm  is  written  in  FORTRAN  77,  and  it  has  been  implemented  on  a SPARC 
workstation.  At  every  branching  node,  the  algorithm  interfaces  with  subroutine  MINOSS  in 
MINOS  5.4  [107]  to  solve  the  linear  program  that  generates  the  lower  bound.  On  the  average, 
every  caU  for  MINOS  5.4  takes  about  1 CPU  second  on  the  SPARC  workstation  for  the  test 
problems  considered.  We  have  done  some  preliminary  testing  of  the  algorithm  on  average 
size  problems.  The  solutions  of  some  of  the  problems  used  in  the  test  are  reported  by  Beasley 
[9],  where  a Lagrangean  relaxation  techniques  are  apphed  to  a minimum-spanning-tree  based 
formulation,  along  with  the  number  of  branching  nodes  and  the  CPU  time.  To  compare 
our  preliminary  results  for  these  latter  problems  with  the  previous  ones  is  a difficult  matter. 
Although  our  algorithm  considered  a smaller  number  of  branching  nodes  as  compared  to 
Beasley’s  algorithm,  the  number  of  branching  nodes  is  not  a decisive  measurement  since 
the  lower  bound  techniques  are  different.  The  CPU  time  is  not  relevant  either  because 
the  previous  results  were  obtained  on  a Cray  X-MP/48  with  vector  processing  capability. 
However,  we  may  conclude  that  our  algorithm  is  robust,  and  it  finds  the  optimal  solution 
in  a reasonable  amount  of  time.  Furthermore,  one  problem  from  the  set  of  grid-graph  test 
problems  given  by  Soukup  and  Chow  [147],  problem  15  in  Table  4.1,  for  which  Beasley’s 
algorithm  did  not  find  an  optimal  solution  [11],  can  be  solved  using  our  algorithm  with 
250357  branching  nodes. 

In  Table  4.2,  we  give  the  results  corresponding  to  some  problems  from  sets  steinc  and 
steinb  obtained  from  Beasley’s  OR- library  [10].  As  we  can  see  the  problem  size  is  dra- 
matically reduced  after  applying  reduction  techniques.  And  the  reduced-size  problems  can 
be  solved  with  a small  number  of  branching  nodes.  In  Table  4.3  the  optimal  solutions  of 
problems  generated  by  the  test  problem  generator  presented  in  chapter  5 are  verified  with 


a small  number  of  iterations. 
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4.8  Conclusions 

In  this  chapter,  we  have  presented  a branch-and-bound  algorithm  for  the  SPG.  This 
algorithm  uses  an  integer  formulation  presented  in  chapter  2 to  develop  lower  bounds,  and 
it  uses  a heuristic  presented  in  chapter  3 to  obtain  a starting  feasible  solution  which  provides 
an  upper  bound.  From  the  preliminary  results  we  have  obtained,  the  algorithm  is  robust 
and  compares  favorably  to  the  state  of  the  art. 
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Table  4.1.  Performance  on  SPRG  problems. 


Prob.  ^ 

n 

opt. 

bran,  node  ^ 

1 

5 

1.87 

3 

2 

6 

1.64 

1 

9 

7 

1.64 

0 

15 

14 

1.48 

250357 

17 

10 

2.0 

3206 

Table  4.2.  Performance  on  SPG  problems  from  sets  steinb  and  steinc. 


bef.  red. 

aft.  red. 

Prob.  ^ 

\Af\ 

1^1 

|7^| 

|AT| 

Ml 

1^1 

opt. 

bran,  node  ^ 

bi 

50 

63 

9 

10 

16 

4 

82 

0 

b2 

50 

63 

13 

14 

25 

7 

83 

0 

b3 

50 

63 

25 

8 

12 

4 

138 

0 

cl 

500 

625 

5 

150 

274 

5 

85 

0 

c4 

500 

625 

125 

149 

272 

57 

1079 

8 

Table  4.3.  Performance  on  problems  of  set  spgtp. 


Prob. 

AT 

A 

opt. 

bran,  node  ^ 

spgtp 1 

20 

40 

5 

986 

r 

spgtp2 

20 

40 

5 

3562 

0 

spgtp3 

20 

40 

7 

2213 

0 

spgtp4 

20 

40 

8 

2864 

0 

spgtp5 

20 

40 

10 

2776 

0 

CHAPTER  5 


A TEST  PROBLEM  GENERATOR  FOR  THE  STEINER  PROBLEM  IN  GRAPHS 

5.1  Introduction 

With  the  increasing  need  to  develop  good  heuristics  for  the  Steiner  problem  in  graphs 
(SPG),  standard  testing  procedures,  to  compare  those  heuristics  in  terms  of  performance, 
have  become  of  importance.  Testing  effectively  the  performance  of  these  heuristics,  how- 
ever, requires  a wide  variety  of  test  problems  with  known  optimal  solutions.  Hence,  the 
construction  of  test  problem  generators  becomes  crucial.  Test  problem  generators  for  in- 
teger programming  problems  are  difficult  to  construct,  and  the  construction  often  requires 
subtlety  and  deep  insight  into  the  problem  being  considered,  see  Pardalos  [113,  112]  and 
Sanchis  [133]  for  examples.  A very  popular  technique  used  in  the  design  of  test  problem 
generators  for  mathematical  programming  is  based  on  the  Karush-Kuhn- Tucker  (KKT)  op- 
timality conditions.  In  fact,  the  first  test  problem  generator  for  nonlinear  programming  uses 
the  KKT  conditions,  and  it  was  proposed  by  Rosen  and  Suzuki  [131]  in  1965.  If  extended 
to  combinatorial  optimization  problems,  the  KKT-condition  technique  could  be  used  in 
constructing  test  problem  generators  for  binary  programming  problems  where  the  corre- 
sponding relaxed  problems,  which  are  obtained  by  relaxing  the  integrahty  constraints,  are 
convex  optimization  problems.  On  the  other  hand,  test  problems  generated  using  the  KKT 
conditions  for  binary  programming  problems  can  be  trivial  depending  on  the  alternative 
formulation  of  the  problem.  For  instance,  if  apphed  to  the  network  flow  problem  formula- 
tion of  the  SPG,  formulation  T\  given  in  chapter  2,  a KKT-technique  generator  constructs 
trivial  test  problems  (trivial  in  the  sense  that  a heuristic  wiU  find  the  solution). 

In  section  5.2,  we  present  a modified  version  of  the  new  binary  programming  formulation, 
formulation  presented  in  chapter  2 where  the  integrality  constraints  are  replaced  with 
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quadratic  constraints,  and  the  constraint  that  eliminates  cycles  is  expressed  mathematically. 
Using  this  modified  formulation,  a test  problem  generator  for  the  SPG  is  proposed.  Also, 
we  show  how  to  construct  nontrivial  test  problems,  and  we  illustrate  the  construction  by  a 
smaU  example.  In  section  5.3,  we  discuss  some  experimental  results,  and  we  give  conclusions 
in  section  5.4. 

5.2  Generating  Nontrivial  Test  Problems  for  the  Steiner  Problem  in  Graphs 

In  this  section,  we  present  an  SPG  test  problem  generator  based  on  the  KKT  optimality 
conditions.  Also,  we  illustrate  by  an  example  how  to  construct  nontrivial  test  problems. 
Let  G — (A/*,  ^,C)  be  an  undirected  graph,  and  let  = (A/*,  be  its  corresponding 

directed  symmetric  graph  obtained  as  mentioned  in  chapter  2.  Furthermore,  let  IZ  (the 
set  of  regular  nodes)  be  a subset  of  A/^,  and  let  r (the  root  node)  be  an  arbitrary  node 
in  TZ.  In  addition,  let  0(i)  and  X(i)  be  subsets  of  arcs  that  are  going  out  of  node  i and 
into  node  i respectively.  Consider  a variant  form  of  formulation  given  in  chapter  2, 
obtained  by  replacing  the  integrality  constraints  by  quadratic  constraints  and  representing 
mathematically  the  cycle  constraint  as  follows: 


minimize  ^ CijXij 

(5.1) 

subject  to:  ^ X{j 

= 1 

: Wj 

(Vj  e 7^\{r}) 

(5.2) 

^ ^ ^ki 

^ X {j 

: Uij 

(V(i,j)eA‘^\0(r)) 

(5.3) 

Vi  - Vj  + W\xij 

1 

VI 

(V(i,j)eA‘^) 

(5.4) 

1 

. 

= 0 

■ Vij 

(V(iJ)eA^) 

(5.5) 

Vi 

> 0 

• 

(Vi  e V). 

(5.6) 

Here,  constraint  (5.5)  is  equivalent  to  X{j  G {0,1}.  Involving  additional  nonnegative  vari- 
ables corresponding  to  nodes,  constraint  (5.4)  eliminates  directed  cycles,  which  correspond 
to  tours  and  subtours;  this  type  of  constraint  is  used  in  formulating  the  traveling  salesman 
problem  [109,  95]. 
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Applied  to  the  above  formulation,  the  KKT  optimality  conditions  are  used  next,  in 
order  to  generate  test  problems  with  known  optimal  solutions  for  the  SPG.  Given  a certain 
network  structure  where  arc  costs  are  not  known,  the  idea  consists  of  choosing  a directed 
Steiner  tree,  having  node  r as  root  node,  and  setting  the  costs  of  arcs  used  in  this  tree 
to  arbitrary  positive  values;  the  costs  of  arcs  whose  reverse  arcs  are  used  in  the  tree  are 
fixed  by  this  process  since  Cij  should  be  equal  to  Cji.  Then,  by  applying  the  KKT  sufficient 
conditions  for  a global  optimum  (Bazaraa  and  Shetty  [7],  p.  147,  th.  4.3.7.),  we  determine 
the  costs  of  the  remaining  non-tree  arcs. 

Let  Aj  and  A/r  be  the  set  of  arcs  and  nodes,  respectively,  used  in  an  arbitrarily  chosen 
directed  Steiner  tree  T,  with  respect  to  7^  C on  ==  (A/*,  that  has  node  r as 

root  node.  Also,  let  variables  yi  be  chosen  in  the  following  way: 

0 < Vi  < |AT|  - l,Vi  G AT\{r}. 

b)  yj  — yi  = 1 + where  0 < c < {\J\f\  — 1)“^,  if  arc  (i,j)  is  in  the  directed  Steiner  tree 

chosen  above. 

c)  yr  = €. 

Then,  for  T to  be  a minimum  Steiner  tree,  the  KKT  global  optimality  conditions  are  given 
by  a system  of  linear  inequalities  and  equalities  involving  arc  costs  and  dual  variables  as 


follows: 


61 


r 

Cij  = -Uij  + Wj  - (2xij  - l)vij  + ^ Ujk  'i{i,j)eA‘^ 

keO{j)\{(j,i)} 


Urj  = 0 

U{j  — 0 


Uij  > 0 

Vij  > 0 


V(r,  j)  € O^r) 

if  (i  e A/'r)AND 

Vi  e M\{n\{r}} 

v(i,j)eA^ 

v{i,j)eA^ 

v(i,i)  G Ai 

v(i,i)e  aVt- 


Note  that  with  the  feasible  choice  of  m made  earlier,  constraints  (5.4)  and  (5.6)  are  never 
binding.  As  a result,  the  corresponding  dual  variables  are  omitted  from  the  KKT  system 
given  above  since  their  values  and  the  components  of  the  gradient  of  the  objective  function 
corresponding  to  variables  yi  are  null.  Also,  observe  that  if  an  arc  Xij{i  G ^\{^})  is  used 
in  T,  then  its  corresponding  dual  variable  U{j  and  the  dual  variable  uji  corresponding  to 
its  reverse  arc  are  allowed  to  be  positive.  Based  on  this  observation  and  given  positive 
costs  for  arcs  used  in  the  tree,  we  can  determine  nonnegative  values  for  the  dual  variables 
corresponding  to  tree  arcs  and  their  reverse  non-tree  arcs  by  starting  from  the  root  node 
r toward  the  leaf  nodes.  Then,  the  costs  of  the  remaining  non-tree  arcs  are  determined  as 
functions  of  the  dual  variables  that  are  already  determined  and  arbitrarily  chosen  positive 
values  for  the  dual  variables  that  have  not  been  determined  yet.  A pseudocode  for  this 
procedure  is  presented  below: 
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Procedure:  GENERATE-SPGTP 

Input:  T (directed  Steiner  tree  rooted  at  r G IZ,  Cj>  (set  of  costs  for  arcs  G T). 

Output:  An  SPG  test  problem. 

BEGIN  PROCEDURE 

Wr  = 0;  PREDECESSOR(r)=0;  Urk  = 0(VA:  C {0}  U {k,  (r,k)  E -^^});  J :=  {r}; 

(**detemune  the  dual  variables  Wi  for  nodes  in  the  tree,  and  u{j  and  vij  for  arcs  such  that  either 
they  or  their  reverse  arcs  are  in  the  tree.**) 

WHILE  J ^H}DO  BEGIN 
j :=  AN  ELEMENT  IN  J; 
i :=  PREDECESSOR(  J)  (unique  in  T); 

^ •=  (i,^)  C 

SUMl  :=  E '^jk  5 

k£K, 

FORALL  A:  G /C  DO  BEGIN 
= RAN  DOM  > 0; 

SUN! "2  . — ^jk  "b  '^jk  d”  '^jk't 

IF  ken  THEN  BEGIN 

ALLOCATE  RANDOMLY  SUM2  TO  Wk  AND  Ukii^l  G {/,  [k,l)  G A^}); 

END 

ELSE  BEGIN 

Wk  :=  0; 

ALLOCATE  RANDOMLY  SUM2  TO  Uki(^l  G {/,  (A:,/)  G A^}); 

END; 

Vkj  :=  RANDOM  > max{0,c/ej  ~ ~ — SUM\  T Ujk}\ 

'^kj  •=  -Ckj  + ujj  + uji  + SUMl  - Ujk  + Vkj; 

END; 

C^UX:)\{j}; 

END; 

(**determine  the  dual  variables  and  costs  for  the  rest  of  the  arcs.**) 

FORALL  (i,j)  IN  A^\A^  DO  BEGIN 

IF  i G ^^T  THEN  BEGIN 
uij  :=  0;  vij  :=  RANDOM  > 0; 

END 

ELSE  BEGIN  uij  :=  RANDOM  > 0;  v^J  :=  RANDOM  > 0; 

END; 

END; 

FORALL  (i,j)  IN  {(^j),  (i  < j)AND({{i,j),(j,i)}nA^  = 0)}  DO  BEGIN 

Cl  :=  -Uij  + Wj  -\-Vtj  + Ujk\ 

C2  :=  —Uji  4-  + Vji  + Uik; 

IF  Cl  < C2  THEN  BEGIN 

cij  :=  C2;  Cji  :=  C2;  Vtj  :=  Vtj  + C2  - ci; 

END 

ELSE  BEGIN 

Cij  :=  ci;  Cji  :=  ci ; Vji  :=  vji  + ci  - C2; 

END; 

END; 

END  PROCEDURE 


Example 

Next,  we  illustrate  by  a small  example  how  the  procedure  described  above  works. 
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Input  (see  Figure  5.1): 

• a graph  as  shown  in  Figure  5.1,  with  Af  = {l,  2, . . . , 9}  , 7Z  = {1, 2, 6,  7, 9}  and  22  directed  arcs  along  with 
their  corresponding  reverse  arcs  . 

• node  1 as  root  node,  T = {(1,3),  (3, 4),  (4, 2),  (2,  7),  (4, 6),  (4, 9)}. 

• C13  = C31  = 5,  C34  = C43  = 1,  C42  = C24  = 6,  C27  = C72  = 1 , C46  = C64  = 1 , C49  = C94  = 1. 

Find  the  costs  of  arcs  such  that  neither  they  nor  their  corresponding  reverse  arcs  are  in  the  tree  in  a way  that  T 
with  its  associated  arc  costs  is  a minimum  spanning  tree  by  applying  GENERATE-SPGTP.  First,  the  dual  variables 
corresponding  to  arcs  such  that  either  they  or  their  corresponding  reverse  arcs  are  in  the  tree  are  determined  in  the 
following  sequence: 


(j=l,k=3) 

(j=3,k=4) 

(j=4,k=2) 

(j=4,k=6) 

(j=4,k=9) 

(j=2,k=7) 


i;i3  = 0 -H- 

^34  = 0 — »■  (W4, 

V42  = 0 — >•  (W2 

V46  = 0 (ills 

V49  = 0 — f (k;9 

1>27  = 0 — )•  (WY 


= 0;w34  = 5);  vsi  = 5 —*■  U31  = 0; 

= 0;u42  = 0;ti46  = 0;n49  = 6);  V43  = 1 — »■  U43  = 0; 
= b;n27  = 0);  i;24  = 0 — »■  ^24  = 0; 

— 1),  1^64  — b >•  t/'04  — 3 5 

= 7);  V94  = 1 ti94  = 0; 

= 1);  V72  = 0 — »■  U72  = 5; 


Second,  uij,  Vij  for  arcs  such  that  neither  they  nor  their  corresponding  reverse  arcs  are  in  the  tree  are  set  as  follows: 
^^12  = '1^21  = 0,V12  = 1,V21  = 7;  Ui5  = W51  = 0,i;i5  = 0,^51  = 4;  W14  = U41  = 0,^14  = 0,^41  = 6; 

ni6  = Uqi  = 0,i>61  = 7,  Vie  = 1 ^25  = 0,^52  = 3,  V25  = 2,V52  = 0;  U23  = V32  = 0,V23  = l,'l^32  = Oj 

'^35  = 0,'f^53  = 1,'^35  = 1,'^53  = Oj  '^•36  = "^63  = 0,^36  ==  0,  V63  = 1;  U37  = V73  = 0,  V37  = 0,V73  = Ij 

^58  = ^85  = 0,^58  = 0,V85  = 1;  Uqj  = uye  = 0,  Ve7  = 1,1^76  = 1|  ^^68  = ^^86  = 0,  Ve8  = 1,^86  = 0; 

1^69  = ^96  = 0,  ”1^69  = 0,^96  = Ij  ^78  ==  0,”l^87  = 2,V78  = 1,^^87  — Oj  ^79  = ^97  = 0,^79  = 0,  ^^97  = Ij 

^89  = 3,  V.98  = 0,  V89  = 0,  V98  = 2.  And  the  Cij  for  those  arcs  are  calculated  as  follows: 

^12  = C21  = 7;  ci5  = C51  = 4;  C14  = C41  = 6;  cie  = cei  = 7;  C25  = C52  = 3;  C23  = C32  = 6;  C35  = C53  = 4; 

^36  = C63  = 6;  C37  = C73  = 6;  C58  = C85  = 5;  ce7  = C76  = 7;  ce8  = C8e  = 6;  059  = C96  = 7;  C78  = c^y  = 4; 

C79  = C97  = 7;  C89  = C98  = 4. 


Output  (see  Figure  5.2): 

• a graph  G,  as  shown  in  Figure  5.2,  with  AT  = {1, 2, . . . , 9}  , 7^  = {1,2, 6, 7, 9}  and  22  imdirected  arcs,  where 
the  minimmn  spanning  tree  for  11  is  of  length  equal  to  15. 

• C13  = 5,C34  = 1,C24  = 6,C27  = 1,C46  = 1,C49  = l,Cl2  = 7,  C15  = 4,  C14  = 6,  Cie  = 7,  C25  = 3,  C23  = 6,  C35  = 4, 
C36  = 6,  C37  = 6,  C58  = 5,  C67  = 7,  C68  = 6,  Ce9  = 7,  C78  = 4,  C79  = 7,  C89  = 4. 


Note  that  if  the  minimum  spanning  tree  heuristic  (using  the  complete  distance  graph  cor- 
responding to  TZ)  is  applied  to  this  problem,  then  the  approximate  solution  found  is  equal 
to  17. 


5.3  Experimental  Results 

As  we  can  see  from  Figure  5.2,  the  generator  discussed  in  the  previous  section  can  con- 
struct nontrivial  test  problems  for  which  heuristics  Uke  the  minimum  spanning  tree  heuristic 
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Figure  5.1.  A network  structure  and  a Steiner  tree. 


Figure  5.2.  A test  problem  for  the  SPG  with  an  optimal  solution  equal  to  15. 
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cannot  find  the  optimal  solution.  The  test  problem  generator  has  been  implemented  using 
FORTRAN  77.  A selected  set  of  test  problems  has  been  generated  using  this  implemen- 
tation. Tables  5.1,  5.2,  5.3,  5.4  and  5.5  represent  five  SPG  test  problems  where,  for  the 
sake  of  simplicity,  the  number  of  vertices  in  the  network  is  fixed  at  20,  and  the  number  of 
arcs  is  fixed  at  40.  However,  the  structure  of  the  network,  the  set  of  regular  vertices  TZ  and 
the  cost  range  are  variable.  For  each  problem,  the  optimal  solution  is  given  along  with  the 
approximate  solution  found  by  the  minimum  spanning  tree  heuristic,  when  applied  to  the 
complete  distance  graph  corresponding  to  TZ,  The  network  is  represented  by  a table  where 
column  1 entries  are  arc  numbers.  Column  2 and  column  3 give  the  two  end  nodes  of  an 
arc.  Arc  costs  are  represented  in  column  4.  As  shown  in  tables  5.1,  5.2,  5.3,  5.4  and  5.5,  the 
range  of  the  relative  error  on  approximating  the  exact  solution  to  the  SPG  by  a solution 
obtained  from  the  minimum  spanning  heuristic  is  from  0 to  20  percent. 

5.4  Conclusions 

In  this  chapter,  we  have  presented  a KKT-based  generator  to  construct  test  problems 
for  the  SPG  in  linear  time.  In  this  generator,  the  KKT  system  corresponding  to  the  mixed 
integer  formulation  proposed  in  section  5.2  is  simplified  by  an  appropriate  feasible  choice 
of  the  continuous  variables.  As  we  have  concluded,  the  generator  can  construct  nontrivial 
test  problems  for  which  heuristics  like  the  minimum  spanning  tree  heuristic  cannot  find  the 
optimal  solution.  The  quality  of  these  test  problems  has  been  established  with  the  minimum 
spanning  tree  heuristic  applied  to  the  complete  distance  graphs  corresponding  to  subsets 
of  regular  nodes.  However,  the  test  problem  generator  is  yet  to  be  assessed  by  measuring 
the  performance  of  other  known  heuristics  on  generated  test  problems.  Along  another  line, 
better  generators  can  be  devised  by  adopting  a sophisticated  systematic  way  of  finding  a 
feasible  choice  for  the  continuous  variables  in  the  mixed  integer  formulation  or  by  using 
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other  formulations  with  more  valid  cuts.  However,  this  can  be  expensive  for  the  generator 
from  the  complexity  viewpoint. 
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Table  5.1:  Test  problem  spgtpl 

of  nodes:  20 
of  arcs:  40 
n = {2,3,7,8,18} 

Minimum  Steiner  tree  set  of  arcs  = {8,15,22,29,31,38} 

Minimmn  Steiner  tree  cost:  986 

Steiner  tree  foimd  by  the  minimum  spanning  tree  hemistic  = {8, 15, 22, 29,31,38} 
Cost  found  by  the  minimmn  spanning  tree  hemistic:  986 


Arc 

Endl 

End2  # 

Arc  Cost 

1 

11 

10 

289 

2 

2 

6 

275 

3 

1 

15 

347 

4 

11 

13 

109 

5 

3 

5 

110 

6 

15 

12 

161 

7 

11 

9 

275 

8 

8 

11 

252 

9 

12 

19 

354 

10 

13 

17 

241 

11 

19 

20 

180 

12 

2 

14 

228 

13 

19 

16 

202 

14 

10 

4 

116 

15 

8 

1 

188 

16 

1 

17 

314 

17 

12 

18 

190 

18 

9 

10 

159 

19 

5 

4 

227 

20 

18 

19 

190 

21 

10 

8 

319 

22 

11 

7 

123 

23 

16 

12 

234 

24 

17 

6 

222 

25 

15 

3 

168 

26 

13 

6 

211 

27 

9 

17 

134 

28 

10 

5 

160 

29 

1 

2 

101 

30 

14 

10 

179 

31 

1 

3 

215 

32 

18 

16 

238 

33 

13 

2 

313 

34 

15 

8 

336 

35 

6 

19 

124 

36 

9 

3 

175 

37 

7 

16 

176 

38 

3 

18 

107 

39 

5 

6 

238 

40 

9 

18 

260 
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Table  5.2:  Test  problem  spgtp2 

# of  nodes:  20 

# of  arcs:  40 

7^  = {2,5,6,7,16} 

Minimum  Steiner  tree  set  of  arcs  = {8, 15, 19, 24, 26, 32, 36, 38} 

Minimum  Steiner  tree  cost:  3562 

Steiner  tree  found  by  the  minimmn  spanning  tree  hemistic  = {8, 15, 17, 21,26, 29, 32, 38} 
Cost  found  by  the  minimum  spanning  tree  heuristic:  4130 


Arc# 

Endl 

End2 

Arc  Cost 

1 

6 

14 

576 

2 

6 

18 

897 

3 

2 

10 

720 

4 

2 

8 

495 

5 

16 

15 

947 

6 

7 

13 

348 

7 

12 

19 

547 

8 

16 

11 

442 

9 

2 

1 

612 

10 

17 

20 

553 

11 

11 

4 

1187 

12 

17 

3 

307 

13 

1 

6 

1211 

14 

2 

12 

1459 

15 

11 

9 

153 

16 

8 

5 

1244 

17 

16 

5 

790 

18 

8 

17 

1032 

19 

11 

17 

830 

20 

9 

10 

1238 

21 

5 

15 

509 

22 

8 

4 

760 

23 

1 

18 

213 

24 

11 

12 

103 

25 

13 

4 

336 

26 

9 

6 

223 

27 

11 

14 

512 

28 

9 

3 

763 

29 

15 

17 

644 

30 

14 

9 

534 

31 

8 

6 

1263 

32 

17 

7 

524 

33 

6 

11 

496 

34 

17 

13 

1438 

35 

20 

4 

740 

36 

12 

5 

442 

37 

10 

8 

752 

38 

6 

2 

845 

39 

1 

17 

770 

40 

9 

4 

1301 
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Table  5.3:  Test  problem  spgtp3 

of  nodes:  20 
# of  arcs:  40 

7^  = {3,6,10,13,14,16,18} 

Minimum  Steiner  tree  set  of  arcs  = {5, 11, 19, 22,  27,31,33,34,37,40} 

Minimum  Steiner  tree  cost:  2213 

Steiner  tree  found  by  the  minimum  spanning  tree  heirristic  = {5, 10, 19, 26, 27, 32, 33, 37, 40} 
Cost  found  by  the  minimum  spanning  tree  heuristic:  2357 


Arc# 

Endl  # 

End2  # 

Arc  Cost 

1 

6 

15 

417 

2 

6 

1 

427 

3 

13 

7 

181 

4 

10 

4 

476 

5 

16 

11 

248 

6 

15 

8 

248 

7 

11 

5 

580 

8 

8 

2 

170 

9 

2 

19 

293 

10 

16 

20 

326 

11 

11 

9 

424 

12 

5 

2 

325 

13 

9 

7 

522 

14 

4 

16 

560 

15 

6 

8 

488 

16 

16 

8 

482 

17 

9 

4 

248 

18 

15 

9 

376 

19 

11 

17 

101 

20 

3 

1 

448 

21 

18 

11 

620 

22 

11 

12 

154 

23 

5 

13 

167 

24 

14 

8 

279 

25 

3 

15 

439 

26 

16 

14 

637 

27 

11 

6 

215 

28 

9 

8 

394 

29 

4 

6 

549 

30 

9 

18 

660 

31 

9 

14 

107 

32 

13 

20 

316 

33 

17 

3 

105 

34 

12 

13 

450 

35 

4 

12 

534 

36 

9 

1 

123 

37 

6 

10 

223 

38 

17 

9 

589 

39 

4 

7 

435 

40 

14 

18 

182 
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Table  5.4:  Test  problem  spgtp4 

of  nodes:  20 
# of  arcs:  40 

7^  = {2,8,10,13,14,15,16,18} 

Minimum  Steiner  tree  set  of  arcs  = {1,4,6,8,9,13, 15,20,21,23,29,35,37} 

Minimum  Steiner  tree  cost:  2864 

Steiner  tree  found  by  the  minimmn  spanning  tree  heuristic  = {2,3,4,  7,8,9, 10, 11, 12,13, 15,20,35} 
Cost  foimd  by  the  minimum  spanning  tree  hemistic:  3230 


Arc# 

Endl 

End2 

Arc  Cost 

1 

16 

11 

469 

2 

9 

4 

260 

3 

16 

4 

88 

4 

11 

12 

215 

5 

1 

16 

469 

6 

11 

5 

107 

7 

15 

17 

319 

8 

17 

14 

450 

9 

12 

18 

223 

10 

13 

4 

400 

11 

6 

13 

372 

12 

8 

4 

363 

13 

2 

15 

152 

14 

1 

18 

472 

15 

6 

10 

182 

16 

17 

19 

305 

17 

14 

1 

339 

18 

17 

20 

306 

19 

11 

7 

267 

20 

11 

9 

101 

21 

11 

17 

154 

22 

17 

3 

274 

23 

5 

8 

355 

24 

2 

10 

663 

25 

16 

6 

420 

26 

8 

17 

404 

27 

9 

10 

658 

28 

5 

16 

290 

29 

11 

6 

289 

30 

8 

3 

373 

31 

11 

15 

277 

32 

14 

9 

865 

33 

8 

6 

448 

34 

6 

9 

400 

35 

9 

2 

105 

36 

17 

10 

651 

37 

5 

13 

279 

38 

20 

4 

84 

39 

10 

11 

661 

40 

9 

5 

289 
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Table  5.5:  Test  problem  spgtp5 

of  nodes:  20 
# of  arcs:  40 

= {1,3,4,5,6,10,16,17,18,19} 

Minimum  Steiner  tree  set  of  arcs  = {3, 8, 13, 22, 26, 27, 28, 32, 33, 35, 36, 37} 

Minimum  Steiner  tree  cost:  2776 

Steiner  tree  found  by  the  minimum  spanning  tree  hemistic  = {8, 13, 22, 26, 27, 28, 32, 33, 35, 36, 37} 
Cost  foimd  by  the  minimum  spanning  tree  heuristic:  2790 


Arc# 

Endl  # 

End2  # 

Arc  Cost 

1 

19 

12 

274 

2 

6 

11 

167 

3 

4 

2 

252 

4 

16 

15 

430 

5 

1 

7 

348 

6 

11 

9 

335 

7 

2 

20 

252 

8 

4 

13 

123 

9 

17 

14 

130 

10 

7 

17 

239 

11 

2 

9 

433 

12 

7 

11 

354 

13 

4 

8 

101 

14 

16 

3 

569 

15 

15 

8 

468 

16 

18 

2 

260 

17 

14 

13 

411 

18 

14 

16 

342 

19 

15 

4 

492 

20 

16 

8 

385 

21 

12 

14 

403 

22 

2 

19 

154 

23 

14 

19 

392 

24 

17 

11 

476 

25 

13 

19 

266 

26 

2 

5 

289 

27 

2 

16 

252 

28 

13 

6 

431 

29 

15 

19 

524 

30 

15 

12 

286 

31 

15 

10 

113 

32 

13 

18 

135 

33 

8 

10 

351 

34 

7 

8 

416 

35 

19 

17 

223 

36 

10 

3 

147 

37 

17 

1 

318 

38 

9 

7 

151 

39 

16 

7 

463 

40 

2 

14 

380 

CHAPTER  6 


CONCLUSIONS  AND  FUTURE  RESEARCH 

In  this  study,  we  have  addressed  the  Steiner  problem  in  graphs  (SPG),  a problem  of 
great  interest  due  to  its  many  applications  in  different  areas  ranging  from  manufacturing 
systems  design  to  network  design.  We  have  approached  different  aspects  of  the  problem  by 
giving  new  formulations,  designing  a new  heuristic  and  a branch-and-bound  algorithm,  and 
devising  the  first  test  problem  generator  for  the  SPG.  Three  mathematical  formulations  for 
the  SPG  are  proposed  in  chapter  2.  Two  of  these  formulations  are  integer  formulations, 
and  the  third  formulation  is  a continuous  formulation  that  is  the  first  of  its  kind  opening 
the  horizon  of  treating  the  SPG  as  a continuous  combinatorial  problem,  a new  area  to  be 
explored  in  future  research. 

In  chapter  3,  a new  heuristic  for  the  SPG,  which  is  based  on  the  idea  of  associating 
preference  weights  with  arcs  to  express  the  influence  of  regular  nodes  on  making  the  choice 
between  arcs,  is  presented.  Different  weight  versions  have  been  tried.  The  experimental 
results  obtained  from  the  best  version  are  quite  encouraging.  Designing  different  weight 
versions  that  are  more  efficient  is  an  open  research  area. 

Using  one  of  the  proposed  integer  formulations,  namely  the  rooted-tree  formulation, 
an  exact  branch  and  bound  algorithm  is  presented  in  chapter  4.  The  algorithm  proves 
to  be  robust  and  relatively  efficient  based  on  prehminary  results  obtained  for  average-size 
problems.  Testing  the  performance  of  the  algorithm  with  a different  lower  bound  device 
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using  different  formulations  with  more  valid  cuts,  like  the  second  proposed  formulation,  or 
using  a different  LP  solver  is  yet  to  be  seen. 

In  chapter  5,  the  first  test  problem  generator  for  the  SPG  is  proposed.  This  generator 
uses  the  Karush-Kuhn- Tucker  optimality  conditions  of  the  rooted-tree  formulation  proposed 
in  chapter  2 to  construct  problems  with  known  solutions  in  linear  time.  From  the  experi- 
mental results,  the  constructed  problems  prove  to  be  moderately  hard.  The  quality  of  the 
generator  may  be  improved  by  adopting  different  formulations  with  more  valid  cuts.  How- 
ever, this  could  be  at  the  expense  of  computation  time.  Designing  a more  efficient  generator 
is  desired.  Not  only  does  designing  more  efficient  SPG  generators  based  on  the  same  idea 
constitute  an  avenue  of  research,  but  also  expanding  the  idea  of  the  generator  to  the  general 
subclass  of  integer  programming  where  the  only  source  of  nonconvexity  is  due  to  integer 
constraints  constitutes  an  area  of  research  by  itself. 

In  summary,  the  contribution  due  to  this  study  is  quite  established  based  on  originality 
and  good  experimental  results.  The  extensions  of  some  established  results  open  new  avenues 
of  research  making  the  impact  of  the  study  on  future  research  clear. 
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